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(57) Abstract 

This invention concerns binding domains e.g. single chain variable domains which are synthetic analogues of other single 
chain variable domains of members of an immunoglobulin family or superfamily. In the analogue, one or more amino acid resi- 
dues is altered as compared to the other domain, so that the analogue is more hydrophilic than the natural domain. The altered 
amino acid is substituted with a residue which occurs in an analogous position in a member of an immunoglobulin family or su- 
perfamily. This increased hydrophilicity means that the synthetic analogue will show less non-specific binding than the natural 
domain. The analogue may retain the binding specificity of the natural domain. Alternatively, the complementarity determining 
regions may be altered to change the binding specificity. The invention also concerns methods for making these binding domains. 
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BINDING DOMAINS 

The present invention relates to molecules 
5 comprising binding domains and methods for their 
preparation and use. In particular the present invention 
relates to molecules comprising domains which are 
synthetic analogues of a natural single variable domain 
of a member of an immunoglobulin family or superf amily . 

10 The present invention also relates to methods for 
designing said molecules comprising domains, the 
molecules so designed, and kits and methods for use of 
said molecules in therapy and diagnosis. 

Antibodies and other members of the immunoglobulin 

15 superf amily, such as T cell receptors, have the ability 
to recognise molecules eg. antigens, specifically and 
bind them with high affinity. In naturally occurring 
antibodies, the binding site for antigen is formed by the 
juxtaposition of variable (V) domains from both heavy (H) 

20 and light (L) chains. Within each of these chains are 
three stretches of amino acids, the complementarity 
determining regions ( CDRs ) , which comprise the residues 
which interact with antigens. The three CDRs alternate 
with four framework regions ( FR ) . Winter et al . 

25 demonstrated that a single V domain was able to bind 
antigen with high affinity and specificity [Ward et al .' 
Nature 341, 544-546 (1989)]. It was proposed that these 
single domain antibodies ( VH ) would have advantages for 
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several applications owing to their small size (relative 
. mo lecular mass C rmm ) = 13, 000 ) compared to whole antibodies 
(rmm=160,000) and other antibody fragments. 

However, VH domains have unique disadvantages that 
5 disappointingly limit their utility. The difficulties 
encountered reflect at least two properties of VH domains 
which may be related. They are expressed in low 
entities when cloned in bacteria (ca. 200pg/ml of 

culture supernatant compared to lOmg/ml for Fv f ragments.)- - - 

10 and during purification of VH domains, substantial 
counts of material are lost. Concentration of the VH 
single domain. for example using ultrafiltration and 
purification on chromatography columns, often leads to 
poor recoveries. This probably reflects non-specific 
15 binding to surfaces. This has been observed directly by 
ward et al. ( 1989) supra., who isolated a high 
proportion of single-domain antibodies exhibiting 
significant (and non-specific) binding to plastic. In 
v**,, non-specific binding in tissues would lead to poor 
performance in applications such as tumour imaging or 

cancer therapy studies. 

Tnese difficulties are no. generally experienced 
with whole antibodies or with fragments of antibodies 
such as Fv or Fab fragments. Therefore, the problems 
25 appear to be a characteristic of antibody fragments 
containing unpaired single domains. 

Thus, the present invention see*s to ameliorate any 
of the above or other problems associated with single 
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variable domain binding members, whilst retaining antigen 
binding for their full commercial exploitation. 

The present applicant has realised that the most 
likely cause of the unfavourable properties of single 
domain antibodies is the exposure to aqueous solvent, of 
the hydrophobic face of a single variable domain eg. the 
VH single domain. In native antibodies, this face 
interacts with the adjacent hydrophobic face of the VL 
domain and is buried within the antibody molecule. 
Exposure of the face would lead to strong interactions 
with, surfaces, for example chromatography matrices, from 
which material could not be recovered. Additionally, the 
present applicant believes that exposure of the face to 
aqueous solvent may lead to decreased stability of the 
- 15 single variable domain eg. the VH single domain, leading 
to turn to unfolding and loss of binding activity during 
purification procedures. Most importantly, this 
hydrophobic face is a potential source of non-specific 
binding,' which considerably limits the utility of these 
20 single variable domain molecules in vivo and in vitro. 

Although the VH single domain is quite small, its 
activity depends on a complex tertiary structure 
comprising interactions between disparate parts of the 
molecule. [Chothia, C. et al . J. Mol . Biol. 186, 651-663 
25 (1985)]. in several cases, there are side-chains in the 
framework which have important interactions with those in 
the antigen combining site. For example, in the anti- 
progesterone antibody D33, tryptophan 47 (framework) has 
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heen shewn « contact progesterone (Arevalo, J.H.. 
Taussig, M. and Wilson. I. personal communication), and 
-hydroxy vitamin K in the -hydroxy vitamin K-binding 
antibody IgGl NEW [Twining, S.S. and Atassi, M.Z. J- 
5 Biol. Chem. 253, 5259 (1978)]. In some antibodies, 
interactions of residue 71 ( framework ) with CDR residues 
have been shown to be important for the maintenance of 

rrhnthia c et al. Nature 342, 877-883 
antigen binding [Chothia, et <=• 

art, that any modifications to the basic antibody or 
variable domain structure will most likely have 
pleiotro D ic consequences on the tertiary structure of the 
antibody or variable domain and render them unable to 
hind antigen with the desired specificity and affinity. 
This is exemplified in recent experiments with anti- 
lysozyme single-domain antibodies. A single amino acid 
change, substituting Asn 35 with His, was found to 
improve the expression level in E.coli by about 1000 
fold. Critically however, the altered molecule was found 
to bind lysozyme only very weakly (E.S.Ward, L.Reichmann, 
G. P. Winter, personal communication). Because the 
interactions responsible for assembly of the active 
structure, particularly of VH domains, are not completely 
understood, any amino acid changes mace to the basic 
structure mu.t be selected extremely carefully- 
Selection of any changes is further complicated by the 
f .ct that the residues at the interface of the VH single 
domain with the V L domain, come from the CDEs as well ^as 
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the framework. The framework residues (for example, 
residues 37,39,45,47,91,93 and 103) at the interface of 
the anti-lysozyme antibody D 1.3 [Amit, A.G. et al , 
Science 233 747-753 (1986), Chothia, C. et al . , Science 
5 233, 755-758 (1986)] are highly conserved in antibodies 
of all species. They are invariant or, rarely, have 
substitutions which are usually conservative, i.e. 
replacement of the amino acid with one having similar 
chemical properties. For example, aliphatic hydrophobic 

10 residues are usually substituted by similar aliphatic 
residues. Thus, if substitutions are made in these 
framework residues, there is a strong risk of disrupting 
the structure of the molecule and hence binding of 
antigen by the antibody or single variable domain eg. VH. 

15 The CDR residues vary between antibodies and determine 
the specificity and affinity of binding to antigen. 
Since the applicant wishes to retain the capacity to bind 
a variety of antigens with high affinity and specificity, 
the ability to vary these CDR residues needs to be 

20 retained. 

The present invention therefore provides a molecule 
comprising a binding domain having a polypeptide sequence 
which is an analogue of part or all of a naturally 
occurring molecule comprising a binding domain having 

25 specificity for a particular binding member, in which 
analogue one or more amino acids are altered to reduce 
the hydrophobicity of said analogue as compared to the 
natural binding substance. 
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The analogue may have substantially the same binding 
characteristics as the natural binding substance eg. with 
res.ect to specificity, affinity, or avidity. In some 
caseS these characteristics may be improved. The 
nation may be any amino acid alteration which reduces 
hydrophobics of the polypeptide sequence eg. amino acid 
substitution, deletion or addition. 

The molecule comprising a binding domain may 
^prise—a— embody" or" ^h^--receptor-molecule-_and_. 



10 



comprise an au^-«-j 

laments and derivatives of antibodies and receptor 
ra olecules. in particular, the molecule may comprise a 
single variable domain of the type that is present in an 
an tibody molecule. The alteration may be in a 
complementarity determining region and/or in a framewor* 
15 region. Preferably, the alteration is in a framewor* 
region. Where the alteration reducing hydrophobic^, x- 
in a framework region, the complementarity determining 
regions may also be altered by way of amino acid 
suhstitution, deletion, addition or inversion to alter 
20 the specificity and or binding characteristics of the 

binding substance. 

The molecule comprising a binding domain may 
comprise any one or more of the alterations described in 

examoles 2 to 11 - 

Thus the present invention provides a single chain 
va . ia bie domain which is a synthetic analogue of another 
single chain variable domain, of a member of an 

- - _ ^ it- = t-^h in which 

immunogloouiin rami-y 
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analogue, one or more interface amino acids residues of 
the domain is altered as compared to the said another 
domain, in which a said altered amino acid is substituted 
with a residue which occurs in an analogous position in a 
5 member of an immunoglobulin family or superfamily, so 
that the analogue is more hydrophilic than the said 

another domain . 

The altered amino acid residue may be in a framework 
region. The altered amino acid residue may be in a 

10 complementarity determining region. The synthetic 
analogue may have essentially the same binding activity 
as the said another domain. The sequence of a 
complementarity determining region may be additionally 
altered by way of amino acid substitution, deletion, 

15 addition, or inversion, to alter the specificity and/or 
binding characteristics of the analogue as compared to 
the said another domain. 

The single chain variable domain may be a synthetic 
analogue of a single variable immunoglobulin heavy chain 

20 domain. In which case, the one or more of the amino acid 
residues 37,39,45,47,91,93 and 103 may be- altered. The 
amino acid alterations may comprise one or more of the 
following: 

i ) substitution of valine 37 with glutamine or 

25 threonine; 

ii). substitution of glutamine 39 with glutamate; 

iii) substitution of leucine 45 with glutamine; 

iv) substitution of tryptophan 47 with aspartate or 
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vii ) 



viii ) 



glycine; 

v) substitution of tyrosine 91 with threonine, 
serine or methionine; 

vi) substitution of alanine 93 with serine or 

glutamate; 

substitution of tryptophan 103 with glutamate 
or tyrosine or threonine; 

substitution of valine 37, leucine 45, tryp- 
tophan 47, alanine 93 and/ or" tryptophan 103 
with any of asparagine, threonine or serine ; 
ix) substitution of valine 37 with threonine and 
glutamine 39 with glutamate and tryptophan 47 
with glycine; 

X ) substitution of tyrosine 91 with serine or 
methionine and alanine 93 with glutamate and 
tryptophan 103 with threonine. 
The single chain variable domains according to the 
present invention may be coupled to a further molecular 
moiety. The further molecular moiety may be an enzymic-, 
fl uorescent-, or radio-label, or a portion of an 

immunoglobulin . 

The present invention also provides diagnostic kits 
which comprise a single chain variable domain as 
described above together with one or more ancillary 
reagents for carrying out the diagnostic test. 

The present invention also provides therapeutic 
comoosition which comprises at least a single chain 
variable domain as described above. The composition may 
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also comprise one or more excipients . 

In an aspect of the invention there is provided 
single chain variable domains eg. VH single chain domain 
frameworks, with improved properties as indicated above 
5 and which allow the generation of specificities for 
various binding partners by substitution of different 
CDRs into the new framework. The applicant also provides 
that the framework residues of molecules comprising 
single domains previously isolated are substituted to 

10 make them more polar. The modified molecule should 
retain the ability to bind the desired antigen. 
Preferably, the changes would not render the interface 
immunogenic when administered to humans. 

The present invention also provides a method for 

15 making a single chain variable domain which is a 
hydrophilic synthetic analogue of another single chain 
variable domain of a member of an immunoglobulin family 
or superfamily, which comprises: 

(i) inspecting the interface regions of a said single 
20 chain variable domain to identify hydrophobic amino acid 

residues ; and ■ 

( ii ) producing a said analgue of said single chain 
variable domain in (i) in which one or. more of said 
hydrophobic residues is substituted with a less 

25 hydrophobic residue which occurs in an analogous position 
in a member of an immunoglobulin family or superfamily. 
The method may comprise: 

(a) obtaining the nucleoside sequence encoding one 
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or more of the identified hydrophobic amino 

acid residues; 

(b) using site directed mutagenesis to alter the 
nucleotide sequence to introduce a triplet 

5 coding for the substitute amino acid, 

(c) using the altered nucleotide sequence in a 
recombinant expression system to express the 
synthetic analogue. 

i^^^^^^^^^^^ 3 ^ " may " " 

X0 be substituted. The substitute amino acids may be 
derived from naturally monomeric members of the 
Immunoglobulin superfamily The naturally monomeric 
.ember may be Thy-1- The synthetic analogue may have 
essentially the same binding activity as the said another 

15 domain. 

„ h ere the molecule comprising a binding domain is an 
immunoglobulin, or a fragment or derivative of an 
Immunoglobulin, an amino acid site suitable for 
alteration may be identified by: 
20 i} examining the molecule for hydrophobic amino acids 
ejected to be on the surface of the binding substance, 
ii, more soecifically examining the amino acid residues 
.elated to those expected to be buried at the interface 
of the heavy and light chain domains of an 

25 immunoglobulin, 

HI, examining those residues identified In (ID that 
would be exposed to the solvent when the single domain xs 
used separately. 
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An appropriate alteration to make may be identified 
with reference to the homologous amino acid sequences of 
members of a family of related substances. For example, 
the amino acid sequence of a molecule at the site for 
5 alteration identified as described above, may be altered 
to make it homologous at that site in one or more members 
of the family group. 

The family of related substances may comprise a 
family of immunoglobulins, fragments and derivatives 
10 thereof. Alternatively, the family of related substances 
may comprise a family of proteins containing domains 
structurally related to the immunoglobulins ie. an 
immunoglobulin superfamily. 

The nucleotide sequence may be altered by site 
15 directed mutagenesis using oligonucleotides designed to 
introduce the required alterations. Alternatively, the 
alteration may be achieved by use of the technique known 
as polymerase chain reaction. 

The present invention also embraces kits having 
20 molecules comprising binding domains as herein provided. 
The kits may be diagnostic, purification or catalysis 
kits . The present invention further embraces 
pharmaceutical products which contain molecules 
comprising binding domains according to the present 
25 invention. 

TERMINOLOGY 
Binding Domain 

This describes an area or. rhe surface of a protein., 
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15 



20 



12 



25 



or B cavitv which specifically binds to, and is therefore 
defined as co^ntary wit, a particular spatial and 
polar organisation of another molecule. A domain is 
fol ded within itself and independently of other parts of 
the same Protein and independentXy of a complementary 

binding member. 
Immunoglobulin 

This describes a group of structurally related 
prot eins consisting of two pairs of heavy polypeptide 
chains and two pairs of light: polypeptide chain, all 
linked together by disulphide bonds. They have a binding 
domain for another molecule such that a given 

• - • ~ _ i i v i-o -that another 
immunoglobulin binds specxrxcally to 

molecule. 

Th e protein can be natural or partly or wholly 
synthetically produced. The term also covers any protean 

= -t, which is homologous to an 
having a binding domain whicn 

immunoglobulin binding domain. 

Herein the numbering of immunoglobulin amino ac.d 
r esidues is according to Kabat E . A . et al. in "Seguences 
of Proteins of Immunological merest" U.S. Department or 

Health and Human Services 1937. 
Antibody 

Th is describes an immunoglobulin whether natural or 
P a«lv or wholly synthetically produced. ■ The term also 
covers anv protein having a binding domain which x. 
homologous to an immunoglobulin binding domain. These 
osteins can be derived from natural sources, or partly 
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or wholly synthetically produced. 

Example antibodies are the immunoglobulin isotypes 
and the Fab, F(ab 1 ) 2 , scFv, Fv, dAb, Fd fragments. 
Immunoglobulin Superfamily 
5 This describes a family of polypeptides, the members 

of which have at least one domain with a structure 
related to that of the variable or constant domain of 
immunoglobulin molecules. The domain contains two p- 
sheets and usually a conserved disulphide bond (see A.F. 
10 Williams and A.N. Barclay 1988 Ann. Rev Immunol. 6 381- 
405. 

Example members of an immunoglobulin superfamily are 
CD4, platelet derived growth factor receptor (PDGFR), 
intercellular adhesion molecule. (ICAM). Except where 
15 the context otherwise dictates, reference to 
immunoglobulins and immunoglobulin analogs in this 
application includes members of the immunoglobulin 
superfamily and analogs thereof. 
Interface 

20 This describes the region on a given heavy or light 

chain of an immunoglobulin which associates with the 
complementary heavy or light chain. 
Framework 

Each chain of an immunoglobulin has a constant (C) 
25 and a variable (V) region. Each V region is made up from 
three complementarity determining regions ( CDR ) separated 
by four framework regions ( FR ) . The CDRs are variable 
stretches of amino acid sequences and provide the 
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function of binding to another molecule. It is the 
possibility of variability which provides immunoglobulins 
with various binding specificities. The F*. are 
substantially constant stretches of — acid sequences 

5 which interpose the CDRs. 

In order that the present invention is more fully 
understood, it will now be described in more detail, 
firstly in general outline and secondly with reference to 
--^--^^^ 

^ . . .„ +i . n The following description 
and not by way of limitation. The ro 

refers to the figures in which: 

Fig ure 1 shows the nucleotide and amino acid sequence of 
, he VH domain of the anti-lyso.yme antibody D1.3 cloned 
in pUCH9; 

Fig ure 2 shows mutant oligonucleotides for substitution 
o£ residues found in naturally occurring immunoglobulin 

heavy chains; 

Fig ure 3 shows mutant oligonucleotides for substitution 
with homologous residues from Thy-1; 

Fig ure 4 shows mutant sequences obtained by substitutions 
of Thv-1 residues into VHD1.3; 

Fiau ; e 5 shows oligonucleotides for the random 
substitution of asparagine, serine or threonine into 
VHD1 . 3 ; 

Fig ure 6 shows a graph illustrating lyso^yme binding 
actWitv of TGI (control). VHD1.3. VHTKY-1 and VKTHV-2: 
Figure 7 shows a graph illustrating lysozyme bincmg 
ac ^^tv of OUC11.9 (control), VHD1.3. VHMutTrp , VHMutLeu 
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and VHThy-3; 

Figure 8 shows a graph illustrating lysozyme binding 
activity of pUC119 (control), VHD1 . 3 , VHThy-1, Thy-2, 
VHThy-1, Thy-3 and VHMutWD; 
5 Figure 9 shows the nucleotide sequence of the vector 
fdPs/Bs around the cloning site and indicates the Pstl 
and BstEll restriction sites; and 

Figure 10 shows a graph illustrating lysozyme binding 
activity of vector fdPs/Bs (control), f dVHThylThy2 , 

10 fdVHD1.3 and phage antibody D1.3. 

The applicant has devised three related strategies 
for the selection of alterations to the frameworks. This 
invention enables the generation of antibodies and single 
variable domains eg- VH domains with improved properties 

15 for in vivo and in vitro use. 

1 . Substitution with VH Interface Res idues Found in 

Naturally Occurring Immunoglobulin Heaw Chains 

There is considerable amino" acid sequence homology 
between different immunoglobulins. Homologies are 

20 detected by lining up different sequences one above the 
other, and sliding the chains along relative to one 
another, until the best level of identity between the 
different sequences is arrived at. These analyses are 
generally performed on a computer. As noted above, the 

25 framework residues are highly conserved, ie. particular 
amino acids will be present in the same positions in a 
series of different antibodies. Rare substitutions do 
occur, however, and the applicant searched for naturally- 
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occurring substitutions of VK interface rescues. 

was perform by reference to readily available 

compilations of antibody sequences [e.g. Kabat, E.A. - 

Proteins of Immunological Interest 
al in "Sequences of prowina 

OS D. P =«„.« Of H»l« *-» S '" iC " 11967,1 ' 

Thi , ,„iy Si . " ^ Pt>Sl "°" " 

>e ««.!«-. » «« """" r -"— ^ n "°" lly 



reaiiseu ^ 

LU d omain structure. These natural substitutions most often 
occur once per antibody molecule. However, the applicant 
also combines the available substitutions from several 
an tibodies, together in tbe same molecule. I* x- 
possible that different combinations of natural 
IS substitutions can be used. 
2 sub^titutio,^^ 
a^e^omo^^ 
T,,m 0 alob>^" suoerfamily 

Of particular interest, are proteins which have 

• rnnta ining the immunoglobulin fold (a tertiary 
20 domains containing 

stru cture characteristic of antibody domains that has 
be en found in other proteins), bu, which do not associate 

with another domain. 

Ex amoles of molecules which contain a single domain 
25 homologous to immunoglobulin variable domains include: 
Thyl p 0 m velin, CD7 , CD28 and CTLA-4 [Williams, A.F. ana 
Barclay. A.N. Ann. Rev. Immunol. 5, 331-405 (1988)]. 

otbe . croteins contain more rhan one unpairec. 
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antibody-like domain. CD4 and MRC OX-2 each contain N- 
terminal domains homologous to variable domains of 
antibody molecules (*V'-type domains, one in MRC 0X2, two 
in CD4 ) , and a C-terminal domain homologous to constant 
5 domains of antibody molecules ('C'-type domain, one each 
in MRC 0X2 and CD4 ) [Williams, A.F. and Barclay, A.N. 
Ann. Rev. Immunol. 6, 381-405 (1988)]. 

The similarity in tertiary structure between these 
unpaired domain proteins and the domains comprising 

10 antibodies is . reflected to some extent in homologies at 
the amino acid sequence level . Amino acid homologies are 
assigned in much the same way as in 1 above, although the 
alignment of sequences in this case can be problematic. 
For Thy-1, the alignment with VH residues 37,39,91 and 93 

15 is relatively straightforward. However, VH residues 45 
and 47 have been published in two different alignments 
with Thy-1 [A.F. Williams and J. Gagnon, Science 216 696- 
703, 1982; A.F. Williams and A.N. Barclay, Ann. Rev. 
Immunol. 6 381-405, 1988]. Exploration of alternative 

20 ' substitutions may be necessary to identify the most 
appropriate, in cases where there is more than one 
possible alignment. 

There is a possibility that the modified VH 
interface exposed in the VH single domain would be 

25 antigenic in humans. This would be a disadvantage for in 
vivo therapeutic use. Substitution with residues which 
occur naturally in human Thy- I may reduce this potential 
antigenic response. 
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_ - oo, l p> t~ Amino Ac ids at; 
3. ^ m ± -Rando m insertion n* Polar win 

Interface Residues 

This strategy enables the improvement of single 
domains using a less direct method. Oligonucleotides for 
5 mutagenesis are synthesized with a mixture of bases xn 
some positions to give ambiguities for amino acid 
insertion at certain triplets. For example, the present 
applicant has designed a strategy which will allow the 

ra nd 0 m-in S ertion..of_the^^^ 

serine or threonine by the use of codons with ambiguities 
at the second position. These residues could be inserted 
tor' instance at the interface positions 37,45,47,93 and 
103 of the VH domain. The 243 possible framework 
resulting from this mutagenesis can then be screened to 
identify which of the semi-random combinations have the 
aesired properties. Strategies for screening the 
resulting semi-random population would include estimation 
o* antigen binding affinity and non-specific binding by 

ELISA (see below). 

This is one example of a number of strategies which 
can be used to vary interface residues in a semi-random 
manner without recourse . to the strategies outlined in 1) 
and 2) above. 

The present invention allows the binding affinity 
and soecificity of identified antibodies to be 
incorporated into single domain molecules with improvec 
properties for in vivo and in_vitro applications. 

The frameworks generated using the model sy«a» 



20 
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using the VH domain of antibody D1.3 (VHD1.3) can be used 
as frameworks for antibodies of any specificity by 
replacement of part(s), or all, of its CDRs with those 
from an antibody molecule of desired affinity and 
5 specificity. There are numerous ways in which this can 
be accomplished. For instance, following the 
determination of the sequence of the CDRs of the antibody 
of desired properties (eg., binding specificity), an 
oligonucleotide or series of oligonucleotides is 

10 synthesized which encodes these CDRs and the framework 
regions containing the nucleotide substitutions necessary 
to make the antibody/ antibody domain more polar. This 
oligonucleotide can then be amplified using PCR, cloned 
into a suitable vector such as pUC119, and the product 

15 expressed in bacteria. 

Alternatively, the relevant changes at the interface 
can be introduced into an existing single domain antibody 
' of desired specificity to improve the properties of that 
antibody. This can be achieved by a variety of methods, 

20 for example by site-directed mutagenesis, or PCR (for 
example using the method of Hemsley, A. et al . [Nuc. 
Acids Res. 16. 6545-6551 (1989)]. 

In addition to this, the improved single domain 
antibodies show reduced non-specific. The properties of 

25 the single chain variable domains provided hereby enable 
selection of single domain antibodies of the desired 
specificity and affinity, for example, using single 
domains cloned into fd phage. The frameworks described 
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in «i. i»« oould t. =1— 1»- » — 

.„istin g a» -Pi— * «■ ° £ «— » ' 

ne „ popul.tion of do»*i„ .ntisody n,oi.= U l«« th.t 

c „ „e screened for the de.ir.d binding 

5 ^temetivelv. « — — » " rtlDOal * , *" 

„e isolated described * — « - < 1989 ' *** 

,„»«. "> a£fi "" Y 7 

specified. «- «- «— - " 

^r-^^^^^^-- 1 --^-- 

» „„ end ins.-t.d into single domain, cloned into Id 

10 invention ana xnse- u«= 

pMg .. Bendo. ..t.g.n.sis of CDKs ~n «.„ be 

perfot-.d end ..Lotion of bodies of «- d.sited 

* cnP^ficity performed using affinity 
affinity and specificity P 

methods ♦ 

Espies 15 an, 16 herein snow that a derivative ox 
VHD1.3 • -re pol« -amewor, «» - diSPl - ed ^ 

phage as a'fusion with g- XXX protein (McCafferty J. et 
al 1990 Nature 348 P 552-554, wit, retention of Ending 
activity Display on phage would enable the generation 
20 of colons of substitutions at residues *y 

semi-random mutagenesis procedures (an example of whxch 
is gi ven in example 11) and the subseguent selection of 
those with favourable binding properties.- 

Since the improved single domain antibodies of the 
25 present invention constitute superior versions of 

^ • -? oc- th^v can be used 
_ . c-v na io domain antibocies , -n_y 

in manv of the ways as are immunoglobulins (Ig> and tne.r 

sup erfamily of molecules or fragments. For example, xg 
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molecules have been used in research, therapy (e.g. 
cancer therapy, modulation of immune status and therapy 
of diseases caused by pathogens), diagnosis (for example, 
estimation of hormonal status), in modulation of 
5 activities of hormones or growth factors, in detection, 
in biosensors, in catalysis, in purification of other 
molecules and in screening regimes for therapeutic 
compounds in the pharmaceutical industries. The lower 
non-specific binding of the improved single domain 
10 antibodies should prove especially useful for the above 
applications . 

The increased hydrophilicity may be of particular 
importance for their use as binding molecules in affinity 
chromatography, especially weak affinity chromatography 

15 (Zopf, D . and Ohison, S., Nature 346 87-89, 1990). 

Anti-idiotypic improved single domain antibodies can 
also be made. Anti-idiotypic specificities [Methods 
Enzymol. 178 . J.J. Langone ed. Academic Press (19 89)] are 
made in a two-stage process. Firstly, antibody A 

20 directed against a particular antigen or epitope is 
itself used to raise other antibodies. A proportion of 
the anti-A antibodies, antibodies B, will be directed 
against the antigen combining sites of antibody A, such 
that the antigen combining sites of B are complementary 

25 to that of A. In effect, the antigen combining site of 
antibody 3, the anti-idiotype, mimics in structure the 
original antigen or epitope recognised by antibody A. 
The original antigen can be a protein or any other 
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co » D oua d , for example a ca^te or a steroid, and 
the anybody used a, any stage in the procedure could be 
an improved single domain antibody. The final anti- 
idiotype antibody can be an improved single domain 
5 antibody produced as described herein, or a molecule of 
th. immunoglobulin superfamily fro- which the ant,- 

w = i are -transferred into an improved 
idiotypic determinant s) are trans 

single domain antibody framework. 

such anti.-idiotypic molecules_ are advantageous in a 

0 varietv of applications [Methods Enzymol. 178. J.J. 
L angone ed. Academic Press (1989,]- These include 
vaccines for treating cancers and diseases caused by 
bacteria, viruses and parasites. They may be used for 
locking cellular receptors for the aforementioned 
L5 pathogens as well as blocking cellular receptors for 
h ormones. They may also be advantageous in diagnostic 
procedures,, for example in place of antigen or peptide in 
ELISA. Anti-idiotypic specificities are known to be 

^-■^=1 ,Trin?tries [Methods Enzymol. 
useful in the pharmaceutical industries l 

20 178. J- J- Langone ed. Academic Press (1989)]. 

_ . „i =-t-oc- to improved single 
The oresent invention relates to in-? 

domain antibodies and receptors derived fro. molecules of 
the immunoglobulin ( I B ) superfamily, methods for 
selecting and effecting said improvements, and methoas 
25 for and kits use of said antibodies or receptors in 
rss earch. therapy, diagnosis, purifications, catalysis 
and discovery of novel therapeutics. 
^^^E^^PRE^ 
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FRAMEWORK BY SUBSTITUTION OF VALINE 37 WITH GLUT AMINE 

Figure 1 shows the nucleotide and amino acid 
sequence of the pUC119 VHD1.3 clone used for mutagenesis 
studies (Ward et al, 1989, supra). The amino acid 
5 residues in the VH domain that interact with the VL have 
been identified as 37,39,45,47,91,93 and 103 (Amit et al, 
(1986) supra, Chothia, C. et al (1986) supra). Amino 
acids occurring naturally in VH domains were surveyed 
using a compilation of immunoglobulin sequences [for 
example, Kabat, E.A. et al. in "Sequences of Proteins of 
immunological Interest" U.S. Department of Health and 
Human Services (1987)]. Substitutions have been found at 
positions 37,39,45,47,91,93 and 103 in naturally 
occurring heavy chains. The most polar substitutions at 
each of these residues in antibodies sequenced to date 
were chosen for replacement of interface residues by 
mutagenesis in the following examples. 

Residue 37 is valine in 385 out of 434 sequences 
surveyed. In 48 other sequences, an aliphatic amino acid 
was substituted. In the single remaining example, a case 
of human heavy chain disease, a significantly more polar 
residue, glutamine, was substituted. An oligonucleotide 
was designed for incorporation of this change in VHD1.3 
( VHMUTVAL , shown in figure 2). This mutation and the 
other mutations described in the examples below can be 
assembled in different combinations to make further novel 
derivatives . 
In Vitro Mutagenesis 



15 



20 



25 
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(1) The oligonucleotides detailed in fig. 2 were 
synthesised on an Applied Biosystems 391 DNA synthesiser 
and purified on Urea-acrylamide gels using standard 
techniques [Sambrook, J. et al.'in "Molecular Cloning: a 

5 laboratory manual (second edition)". Cold Spring Harbor 
Laboratory Press, 11.23] prior to in vitro mutagenesis. 

(2) Preparation of single stranded DNA template for 
mutagenesis. The VH D1.3 antibody gene (Ward et al, 

1989, supra) used to exemplify this invention was carried 

10 on plasmid pUC 119 (Sambrook, J. et al. in "Molecular 
Cloning: a laboratory manual (second edition)". Cold 
Spring Harbor Laboratory Press 1.14). Single-stranded 
template DNA was prepared by infecting TGI cells carrying 
the plasmid with M13 K07 helper phage using standard 
15 techniques for growth and purification (Sambrook, J. et 
al. in "Molecular Cloning: a laboratory manual (second 
edition)". Cold Spring Harbor Laboratory Press 4.46). 

(3) site-directed mutageneses were performed using the 
"in vitro Mutagenesis system. Oligonucleotide directed 

20 (version 2)" (Amersham International) exactly as per 
manufacturers instructions. Ampicillin-resistant 
colonies resulting from the mutageneses were grown 
overnight in 2YT (2YT=per litre of water, 16g Bacto- 
tryptone, lOg Yeast extract, 5g NaCl ) containing lOOug/ml 

25 ampicillin. These cultures were diluted in fresh 2YT and 
single stranded template DNA prepared by Ml 3 K07 
infection as in (2) above. Mutants were verified by DNA 
sequencing using the Sequenase version 2.0 kit. 
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EXAMPLE 2 PREPARATION OF VH Dl . 3 WITH A M ORE POLAR 
FRAMEWORK BY SUBSTITUTION OF GLUT AMINE 39 WITH GLUTAMATE 

Using similar logic and methods to those described 
in example 1, an oligonucleotide VHMUTGLN (fig. 2) was 
5 designed for the introduction of a glutamate residue at 
position 39. This substitution is found in 1 of 420 
heavy chains surveyed. Glutamate is considered to be 
marginally more polar than glutamine [Rose et al, Science 
229 834-838, 1985] . 

10 EXAMPLE 3 PREPARATION OF VH D1.3 WITH A MORE POLAR 
FRAMEWORK BY SUBSTITUTION OF LEUCINE 4 5 WITH GLUTAMINE 

Using similar logic and methods to those described 
in example 1, an oligonucleotide VHMUTLEU (figure 2) was 
designed which introduces a substitution of leucine with 

15 glutamine at residue 45 (fig. 2). This modification is 
found in 2 of 402 sequences surveyed (396 have leucine at 
this position). One of the antibodies containing 
glutamine at this position is a mouse antibody specific 
for anti-Bl, 6D-galactan. 

20 EXAMPLE 4 PREPARATION OF VH Dl . 3 WITH A MORE POLAR 
FRAMEWORK BY SUBSTITUTION OF TRYPTOPHAN 47 WITH ASPARTATE 
Using similar logic and merhods to those described 
in example 1, an oligonucleotide VHMUTWD (figure 2) was 
designed for the introduction of . aspartate at position 

25 47. This substitution is found in one of 392 heavy- 
chains surveyed. 

EXAMPLE 5 PREPARATION OF VH D1.3 WITH A MORE POLAR 
FRAMEWORK BY SUBSTITUTION OF TYROSINE 91 WITH THREONINE 
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Using similar logic and methods to those described 
in example 1, an oligonucleotide VHMUTTYH was designed 
for the introduction of threonine at position 91. This 
substitution is found in one of 398 heavy chains 
surveyed . 

* PREPARATION QJ^ ^^L^^^ 
ttt? &MKWORK BY c^TTTUTION OF AT.SNT NE 93 WITH SERINE 

Using similar logic and methods to those described 
in example 1, an oligonucleotide VHMUTALA (figure 2) was 
designed for substitution of alanine 93 with serine. 
This substitution is found in 4 of 410 heavy chains 

« «<p j- hP cp is in a mouse anti-B2,l 
surveyed. One of these is 

fructosan . 

cYaMPT.P! 7 PREPARXTTHM OP VH M . 3 WITH A MORE_POLAR 
ttp a MP!WORK BY — — " F TRYPTOPHAN 103 WIT H 

GLUTAMATF OR TYROSINE 

Using' similar logic and methods to those described 
in example 1, two oligonucleotides VHMUTTRP and VHMUTWY 
(figure 2) were designed for the introduction of 
giutamate and tyrosine respectively at position 103. 
T nese substitutions are found once each in 308 heavy 
chains surveyed. Glutamate is much more polar than 
tryptophan. Tyrosine, although more polar than 
tryptophan is a more conservative substitution. 

H PREPARATION OF VH D1.3 WITH_A_JlpRE__POLAR 
_ Linr . rr .. ^ co^THN OF RESIDUES IN_P0SITTgNS_^39 

AND 47 WITH HOMOLOGOUS Rr.SIDU^ r ' L — — ' 

FAMILY PROTEIN THY- 1 
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Thy-1 is a single domain protein of the immunoglobulin 
superfamily. Alignment of the residues of Thy-1 with 
those of immunoglobulin heavy chains has been performed 
by Williams ( A- F. Williams and J. Gagnon Science 216 696- 
5 703 1982; A.F. Williams and A.N. Barclay Ann. Rev. 
Immunol. 6 381-405, 1988). Although residues 37,39,91 
and 93 of the VH domain interface were aligned with the 
same residues of Thy-1 in both publications, the residues 
at positions 45 and ' 47 were aligned with different 
10 residues reflecting the lower degree of homology of 
adjacent amino acids from these positions. The 
oligonucleotide VHTHY-1 (figure 3) was designed to 
incorporate the most polar residues at positions 37,39 
and 47 found at homologous Thy-1 positions using the 
15 alignment published by Williams & Gagnon (1982), supra). 
Mutagenesis was performed as in example 1. The amino 
acid substitutions generated are shown in figure 4. 
EXAMPLE 9 PREPARATION OF VH Dl . 3 WITH A MORE POLAR 
FRAMEWORK BY SUBSTITUTION OF RESIDUES IN POSITIONS 91,93 
20 AND 103 WITH HOMOLOGOUS RESIDUES FROM THE IMMUNOGLOBULIN 
FAMILY PROTEIN THY-1 

Using the strategy described in example 8, an 
oligonucleotide VHTHY-2 (figure 3) was designed to 
incorporate the most polar substitutions of VH residues 
25 91,93 and 103 at homologous Thy-1 residues [Kabat, E.A. 
et al . in "Sequences of Proteins cf Immunological 
Interest" U.S. Department of Health and Human Services 
(1987). Another oligonucleotide VHTHY-3 (figure 3) was 
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designed to incorporate the residues found at these 
positions in rat brain Thy-1 which appears most polar 
overall at residues homologous to the VH interface. 
Mutagenesis is as described in example 1. The amino acid 
changes generated are shown in figure 4. 

r~^r* in PREPARATION OF ^ 01 3 WITH A MORE POLAR 
:=2 ^M RY SUBSTIT UTION OF RFSTniTFS IN POSITIONS 
■^T^- ^ .«n WTTH HOMOLOGOUS RESIDUES FROM_THE 

-TMMIINOGLOBt.TT.TN. .FAMILY PROTEIN THY-1 

The amino acid changes detailed in examples 8 and 9 
were combined by performing a mutagenesis experiment as 
in example 1 using the DNA sequence of mutant protein 
VHTHY-1 as template and mutant oligonucleotides VHTHY-2 
and VHTHY-3 (figure 3) to incorporate site directed 
changes . 

CT .mpt.« XI PREr— - V" °l-3 «"H A MORE POL AR 

^ BV SUBSTITUTION OF RESIDUES IN POSITIONS 

v 4S.47.~93 WITH AS ^^^TME. THREONINE OR SERINE 

Oliaonucleotides are prepared containing the triplet 
G XT at each of the positions 37,39,43,47,91,93 and 103 
(where X is a random mixture of the bases C,G and T; 
figure 5). Use of these as mutagenesis primers as in 
examole 1 would generate the insertion of Ser, Thr and 
Asn respectively depending on which base was 
incorporated. Derivatives generated are then screened 
for antigen binding and improved properties. 

i ? . ASSESSMENT OF ANTIGEN ENDING STATUS OF 
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VH D1.3 interface mutants VHThy-l and VHThy-2 
constructed as described in example 8 and 9 were assessed 
for lysozyme binding activity. Antigen binding status of 
mutant single domain antibodies was determined by ELISA 
5 (Enzyme Linked Immuno adsorbent Assay) according to 
techniques well known in the art. 

This is just one of a whole range of methods that 
can be used to measure antigen-antibody binding. Others 
include Western blotting, competitive radio immunas say and 
10 fluorescence quench. 

The ELISA for lysozyme binding by mutant single 
domains was undertaken as follows: 

Overnight cultures of ampicillin-resistant clones 
were diluted 1 in 10 into fresh 2YT (with lOOug/ml 
15 ampicillin) and grown for lhr at 37 a C. Isopropyl p-D- 
thiogalactopyranoside (IPTG) was added to ImM final 
concentration and the cells cultured for a further 24-30 
hrs at 37°C. Supernatants were prepared by 

centrif ugation for use directly in the ELISA. 
20 1) The plates (Falcon microtest III flexible plate) 
were coated with 200ul per well of lmg/ml hen egg 
lysozyme in 50mM NaHC03, pH 9.6 overnight at room 
temperature . 

2 ) The wells were rinsed with three washes of phosphate 
25 buffered saline ( P3S ) , and blocked with 300ul per well 2% 
skimmed milk powder in PBS for two hours at 37 °C. 
3 ) The wells were rinsed with three washes of PES and 
200ul of culture supernatant were added and incubated for 
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two hours at room temperature. 

, j ihree times with 0.05% Tween 

4) The wells were washed v.hree 

20 in PBS and three times in PBS . 

5) 200,1 of a suitable dilution (1 in 1000, of rah,, 
poXyclonal antiserum against the Pv fragment in 2% 

incubated for two hours. 

6) washes were repeated as in (4). 

7 , 2 00UX- of.. agitable., dilution (1 in 5000 ) of goat 

an ti-rabbit antibody (Sigma) coupled to 
peroxid.se, in 2% — — 

to each well and incubated at room temperature for one 



hour . 

8) Washes were repeated as in (4). 
15 9) 



20 



25 



„, aoci 2.2. ...w.-""-'- 

tS lB n,„ <0. 5 5» S /»1 in 

L* —r— «~ 5 « PlUS 4 "" ^ 

P- •«—'• '"" h 1P1 30S hyat0 °* n 

temperature . _ 
' T he reaction was stopped by adding 0.05% sod.um 

at 4C5nm in a TiterteX Multis.an M.C. to give an optical 

* nr each weU. The optical density 
density reaamg xor eacn 

reading is In proportion to th. amount and aff^ty or 
t h e primary antibody used in. the E,XSA, in this case a 
single domain antibody. 
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The result shown in figure 6 demons tr aires that these 
mutants had retained their ability to bind lysozyme. The 
VHThy-1 mutant appears to have higher affinity /quantity 
of lysozyme binding activity than the parent VH D1.3, 
5 whereas mutant VHThy-2 has slightly less. 

Given the complexity of the problem (see above), 
this result was surprising and reinforces the point that 
the properties of VH single domain antibodies can 
surprisingly be improved without unduly compromising 
10 antigen binding, as taught in this document. 

EXAMPLE 13 ASSESSMENT OF ANTIGEN BINDING STATUS OF 
VHMUTTRP, VHMUTLEU AND VHTHY-3 MUTANT SINGLE DOMAIN 
ANTIBODIES 

VHD1.3 interface mutants VHMutLeu , VHMutTrp and 
VHThy-3 constructed as described in examples 3,7 and 9 
respectively were assessed for lysozyme binding 
activity., - Antigen binding status of mutant single 
binding domains was determined by ELISA as described in 
example 12. The result shown in figure 7 demonstrates 
that these mutants had retained their ability to bind 
lysozyme. 

EXAMPLE 14 ASSESSMENT OF ANTIGEN BINDING STATUS OF VHTHY- 
1, THY- 2; VHTHY-1, THY- 3 AND VHMUTWD MUTANT SINGLE DOMAIN 
ANTIBODIES 

25 VHD1.3 interface mutants VHThy-1 , Thy-2; VHThy-1, 

Thy-3 and VHMutWD constructed as described in examples 10 
and 4 respectively were assessed for lysozyme binding 
activity. Antigen binding status of mutant single 
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bi nding drains was determined by ELISA as described in 
examole 12. The result shown in figure 8 demonstrates 
that these mutants had retained their ability to bind 
lvsozyme . 

5 ' Thus, extensive changes (6 amino acid substitutions) 
can be made at the VH interface increasing the polarity 
of the domain without affecting the ability to bind 
lyso2 yme. The approach used to select substitutions 
^ugh-t-^n-thi-s-document-may-be-expe^a to be applicable 



10 to any VH domain. 

f^o^/Bs FOR DT SPTAY ON PHAGE 

The gene encoding the derivative VHThy-1, Thy-2, 
generated in example 10 by using the DNA sequence of the 

X5 mutant protein VHThy-1 as template and the .utant 
olig onucleotide VHThy-2 to incorporate site directed 
changes, was subcloned in.o the vector fdPs/Bs for- 
display of this VH domain on phage as a fusion with gene 
HI protein. The vector fdPs/Bs is similar to fdCATl 

20 (Mccafferty, J. et al, 1990 Nature 348 P 552-554, except 
that it contains Pstl and BstBll restriction sites for 

cloning ( Figure 9 ) . 

A minioreo of pUC119 VHThy-1 , Thy-2 DNA was prepared 
using standard procedures (Sa^brooK et al, 1989 supra). 
25 The VHThy-1, Thy-2 encoding sequences were amplified by 
PCR using the primers RVHTKYFOR and KS J6 . 
RVHTHYFOR 

5 ■ TGA GGA GAC GGT GAC CGT GGT GCC TTG GCC AGT G 3 • 
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This incorporates a BstEll site at the 3* end of the 

VHThy-1, Thy- 2 gene. 

KSJ6 

5' AGG TGC AGC TGC AGG AGT CAG G 3' 
5 This incorporates a PstI site at the 5' end of the VHThy- 
1, Thy- 2 gene. 

PCR was performed using 20mM Tris (pH7.3 at 70*C), 
50mM KC1, 4mM MgCl 2 , 0.01% gelatin with lOuM each 
oligonucleotide, lmM each dNTP, 5 units Taq polymerase 

10 and approximately 50ng pUC119VHThy-l , Thy-2 DNA in a 
total volume of lOOul . The product of the PCR reaction 
was ethanol precipitated and resuspended in 20ul lOmM 
Tris, pH8.0, 0.1 mM EDTA, A lOul portion was digested 
using PstI (20 units) and BstEll (20 units) in NEB buffer 

15 2 in a total volume of 50ul at 37 °C for 2h (restriction 
enzymes obtained from New England Biolabs, CP Labs, 
Bishops Stortford). Following digestion, the reaction 
mixture was phenol extracted and ethanol precipitated. 
The product was electrophoresed on a 1% agarose Tris- 

20 acetate-EDTA gel and the band of approximately 3 50bp 
excised and the DNA purified using Geneclean (Bio 101, La 
Jolla, California). Vector DNA ( f dPs/Bs RF form) was 
prepared using standard procedures (Sambrook et al, 1989 
supra). This DNA ( 1 . 2ug ) was digested with PstI and 

25 BstEll (50 units) in lOOul NEB buffer 3 at 37°C for 90 
min. The products were phencl extracted and ethanol 
precipitated and the resuspended DNA was phosphatased as 
described by Sambrook et al, (1989 supra). A preparative 
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0 7% Tris-borate-EDTA agarose gel was pe.forn.ed and the 
ba nd of approximately 9Kb exceed and the DNA purified 
using Geneclean and suspended in 10 .1 10-. Tris, 
P H8.0, 0.1-i EDTA. Ligation was performed using 5ul each 
5 of tne digested vector and insert DNA using 200 units of 
T4 DNA Xigase in 10ul NEB ligase buffer. Tne ligation 
mix ture (8ul) was transformed into content E.colx 

_= arcordin a to Sambrook et al, 1989 
MC1061 cells preparea according 

t nn 2YT aaar containing 
su^a )— and-the- mixture -pla-ted_on„ 2YT a , 



10 



15 



20 



25 



supra ; 1 

20M /«1 tetracycline. Colonies were picked, single 
sanded DNA was prepared < SambrooK et al, 1989 supra) 
and the DNA was sequenced using a Seguenase 2.0 *xt 
(United Sates Biochemical, Cleveland, U.S.A. The 
seqU ence of the insert corresponded to VHThy-1, Thy-2. 
The derivative has been named fdVHThy-1, Thy-2. 

A clone of VHD1.3 in fdPs/Bs was prepared starting 
from PUC119VHD1.3. The insert encoding VHD1.3 was 
pr eoared by digestion of pSWl-VHDl . 3-TAG1 (Ward E.S. et 
al " 1989 suora) with P.tl and BstEll. Other procedures 
weT -e as above. This derivative has been named fdVHD1.3. 

. s- ~j IS , ASSAY ™ THY-2 DO MAIN DISPLAYED 

EXAMPLE 16 cLISA Abb>A* ur 

ON PHAGE 

Th e fdVHThy-1, Thy-2 phage constructed in example 1= 
was shown to be functional in the binding of the antigen, 
lysozvme, using an ELISA assay. 

V<rsl oarzicles were prepared by growing E.col, 
MC1061 cells containing fdVHThy-1, Thy-2; fdVHD1.3; phage 
an tibodv D1.3 (displaying scFvD1.3; McCafferty, 3. et al. 
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1990 Nature 348 p552-554) or fdPs/Bs in 50ml 2YT medium 
containing 15ug/ml tetracycline for 16 to 24h. The 
culture supernatant was collected by centrif ugation for 
10 min at lOOOOrpm in an 8 x 50ml rotor. Phage particles 
5 were precipitated by adding 1/5 volume 20% polyethylene 
glycol (FEG)/2.5M NaCl and leaving at 4°C for lh. Phage 
particles were pelleted by centrif ugation for 15 min as 
described above and the pellets resuspended in sterile 
lOmM Tris, pH8 . 0 ImM EDTA containing 1% gelatin to 1/40 
10 the of the original volume. 

1. EL.ISA plates were coated with lysozyme and blocked 
with PBS containing skimmed milk powder as described in 
example 12. 

2. Wells wer--* rinsed with PBS. 

15 3. Concentrated phage ( 200ul ) was added to each well as 
appropriate and incubated at room temperature for 2h. 

4. The wells- were washed three times with 0.5% Tween 20 
in PBS and three times with PBS. 

5. Sheep anti-M13 serum (200pl; 1 in 1000) in PBS 
20 containing 2% skimmed milk powder was added to each well 

and incubated for lh. 

6. Washing was repeated as in 4. 

7. Peroxidase conjugated rabbit anti-goat 
immunoglobulin (200ui; 1 in 5000; Sigma) was added and 

25 incubated for lh. 

8. Washes were repeated as in 4. 

9. Peroxidase substrate was added as in example 12 and 
colour allowed to develop for lh. 
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Both faVHThy-1. Thy-2 and fdVHDl.3 gave ELISA 
sig nals 4 to 5 ti.es above the value obtained with 
fd P S/ Bs, whereas with phage antibody OX. 3, the signal was 
proximately 8 ti.es that with fdPs/Bs (Figure 10). 
Thu . modification ox' the VH interface residues does not 
affect the ability of the domain to bind lysozyme when 

displayed on phage. 

'it will be appreciated by those skilled in the art 
that the present invention - has bee_n described, above by 
way of example only, and that considerable modifications 
to the procedure may be made to effect a similar outcome 
without departing fro- the scope of the invention. 



SUBSTITUTE SHE 



WO 92/01787 



PCT/GB91/01253 



CLAIMS 

1. A single chain variable domain, which is a 
synthetic analogue of another single chain variable 

5 domain of a member of an immunoglobulin family or 
superfamily, and in which analogue, one or more interface 
amino acid residues of the domain is altered as compared 
to the said another domain, in which a said altered amino 
acid is substituted with a residue which occurs in an 
10 analogous position in a member of an immunoglobulin 
family or superfamily, so that the analogue is more 
hydrophilic than the said another domain. 

2. A single chain variable domain according to 
claim 1 in which a said altered amino acid residue is in 

15 a framework region. 

3. A single chain variable domain according to 
claim 1 or claim 2 in which a said altered amino acid 
residue is in a complementarity determining region. 

4 . A single chain variable domain according to any 
20 one of claims 1 to 3 wherein the synthetic analogue has 

essentially the same binding activity as the said another 
domain . 

5 . A single chain variable domain according to 
claim 2 or claim 3 in which the amino acid sequence of a 

25 complementarity determining region is additionally 
altered by way of amino acid substitution, deletion, 



SUBSTITUTE SHEET 



PCI7GB91/01253 

WO 92/01787 

38 

addition, or inversion, to alter the specificity and/or 
binding characteristics of the analogue as compare, to 

the natural domain. 

6 A single chain variable domain according to any 
5 one of claims 1 to 5, which is a synthetic analogue of a 

single variable immunoglobulin heavy chain domain. 

7 A single chain variable domain according to 

claim 6, in which one or more of the amino acid residues 

37 39, 45, 47, 91, 93 and 103 is altered. 

10 8 a single chain variable domain according to 

comprise one or more of the following: 

i, substitution of valine 37 with glutamine or 

" threonine ; 

15 ii) substitution of glutamine 39 with glutamate; 

iii) substitution of leucine 45 with glutamine; 
1V ) substitution of tryptophan 47 with aspartate or- 

glycine; 

. ^ rvrosine 91 with threonine, 

v) substitution of tyrosine 

serine or methionine; 
vi) substitution of alanine 93 with serine or 

glutamate ; 

vii) substitution of tryptophan 103 -with glutamate 
-tyrosine or threonine; 
viii) substitution of valine 37, leucine 45, 
tryptophan 47, alanine 93 and/cr tryptophan 103 



20 
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with any of asparagine, threonine or serine; 
ix) substitution of valine 37 with threonine and 
glutamine 39 with glutamate and tryptophan 47 
with glycine; • 

5 x) substitution of tyrosine 91 with serine or 

methionine and alanine 93 with glutamate and 
tryptophan 103 with threonine. 
.9 . A single chain variable domain according to any 
one of claims 1 to 8 which is coupled to a further 
10 molecular moiety. 

10. An immunoglobulin single chain variable domain 
according to claim 9 wherein the further molecular moiety 
is an enzymic-, fluorescent-, or. radio-label, or a 
portion of an immunoglobulin. 
X5 11. A diagnostic kit which comprises a single chain 

variable domain according to any one of claims 1 to 10, 
together with one or more ancillary reagents for carrying 
out the diagnostic test. 

12. A therapeutic composition which comprises at 
20 least a single chain variable domain according to any one 

of claims 1 to 10. 

13 . A method for making a single chain variable 
domain which is a hydrophilic synthetic analogue of 
another single chain variable domain of a member of an 

25 immunoglobulin family or superfamily, which comprises: 

(i) inspecting the interface regions of a said single 
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chain variable domain to identify hydrophobic amino acid 
residues ; and 

producing a said anaXogue of said single chain 
variable domain in (i) in which one or more of saxd 
S hydrophobic residues is substituted with a less 
Hydrophobic residue which occurs in an analogous position 
i„ a member of an immunoglobulin family or superfamily. 

14. A method according to claim 13 which comprises: 
( a, obtaining the nucleotide sequence encoding one.. 
lQ or more of the identified hydrophobic amino 

acid residues; 
'(to) using site directed mutagenesis to alter the 
nucleotide sequence to introduce a triplet 
coding for the substitute amino acid, 
15 (c) using the altered nucleotide sequence in a 

recombinant expression system to express the 
synthetic analogue. 
15 A method according to claim 13 or claim 14 
Mherei n more than one amino acid residue is substituted. 

16. A method according to any one of claims 13 to 
15 wherein the substitute amino acids are derived from 

. +-H(= immunoglobulin 

naturally monomeric members of the 

superfamily- 

17. A method according to claim 16 wherein the 
25 naturally monomeric member is Thy-1. 

18 . A method according to any one of claims 13 to 
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17 wherein the synthetic analogue has essentially the 
same binding activity as the said another domain. 
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GCflTGCRflflTTCTflTTTCflR_GG^6.flC fiCT Cfl TflflTGR flfi TfiC CTR TT GCC TRC GG CflG CC 
10 20 30 -10 50 60 

PelB leader 

S G L 1 LLPROPfi"* Q U Q L Q E S 
GCTGGRTTGTTflTTflCTCGCTGCCCflflCCRGCGRTGGCCCRGGTGCflGCTGCflG.GRCTCR 

70 80 90 100 HO 120 

Pat I 

GPGLURPSqSLSITCTUSGF 
GGRCCTGGCCTGGTGGCGCCCTCflCRGRGCCTGTCCRTCflCflTGCRCCGTCTCflGGGTTC 
130 no 150 160 170 180 

SLTGVGUHUURQPPGKGLEU 
TCRTTRRCCGGCTRTGGTGTflRflCTGGGTTCGCCRGCCTCCRGGRRRGGGTCTGGRGTGG 
190 200 210 220 230 240 

UHD1 .3 

LG n IUGOGHTDVHSRLKSRL 
CTGGGRflTGRTTTGGGGTGflTGGflflflCRCflGRCTRTRflTTCRGCTCTCRRRTCCRGRCTG 
250 260 270 280 290 300 

S i SKDNSKSQUFLKHNSLHT 
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3!0 320 330 340 350 360 

OOTRR-VVCHRERDVRLDVUG 
GflTGflCflCRGCCRGGTRCTfiCTGTGCCRGRGflGflGfiGflTTflTflGGCTTGflCTflCTGGGGC 
370 380 390 400 110 420 

QGTTLTUSS 
CRflGGCRCCRCTCTCRCRGTCTCCTCRTflflTRRGflGCTR 
«30 440 450 



Rg I. 



WO 92/01787 



zj/o 



PCT/GB91/01253 



a 
> 



a 

c 



4) 
u 

a 

s 







< 




cj 




cj 




< 




t 1 








p~ 




h- 




f- 




O 




< 












CJ 




< 




< 








CJ 




O 




o 




E- 




CJ 




o 
















O 




f— 




CJ 




CJ 




f— 


f<*> 


r 


f . 

t 




E— 


CJ 


CJ 


CJ 


cj 


CJ 






< 




CJ 


< 


< 


CJ 


CJ 


cj 


cj 


f— 


' E— 


CJ 


CJ 


< 


cj 


CJ 


E- 




< 


a 




< 


< 


CJ 


cj 




CJ 


CJ 


cj 


(- 


{— 


{— 




< 


< 


(J 


cj 


f— 




< 


< 


< 


< 


< 


< 


CJ 


CJ 







CJ 

E- 
t- 

E- 
CJ 

CJ 
CJ 

< 

& 

CJ 



< 

CJ 

(J 

CD 
< 



< 
CJ 

cj 
cj 

< 
< 

o 
cj 
o 
cj| 

E— 

CJ 

t3 
O 
< 

o 
o 



< 

cj 
cj 
< 
cj 
< 



< 

CJ 

CJ 



o 
o 
p 
G 

o 
o 
< 
o 



m 
E- 
o 
t— 

tj 

8 

f- 
cj 

< 
E- 
O 

< 

o 

in < 

cj 

2 2 

r o 
> o 

to. E- 
* CJ 

1 B 

*■* E- 

« CJ 

T CJ 
— • E— 

< 

< 
E— 
< 



8 

m 

■ 



u 

■ 



cj 

• cj 

< 
< 

cj 
(— 

< 

i— 

a 

< 
cj 
cj 

CJ 
CJ 

C3 
E— 
E— 

CJ 
CJ 
O 

f— 

O 



to 
< 

E- 
CJ 
CJ 

o 

< 
< 

CJ 

< 

CJ 

CJ 
O 

o 
E- 

E— 

(-> 

f— 

o 



< 
(-) 

< 
< 

E— 
O 

< 

I 

a 
o 



o 

E— 

o 



< 

E— 

O 

< 

< 
< 

<l 

o 
f- 

CJ 
E— 
cj 
r— 
CJ 



f- 
O 
i — 

CJ 

O 

E— 

<J 

CJ 

< 
f- 

61 

< 

CJ 

< 
CJ 

o 

r- 

CJ 

f- 

CJ 
r- 



| < s 



o 5 « < 3 



■ — I • o c 

H- P- t- ?. E- — 

^3 



-J 



LI ed 



X O 



3 



^ 7 >- ? j ^ > ^ 

f «-i IT Prt T "T" 



WO 92/01787 



3/| 0 PCT/GB91/01253 



e 

s 



8 

I 



ti 
w 
0 

s 
y 



CO 




< 


< 










< 


< 








j< 
<l 




f— 


; — 




f— 


i- 






< 


< 












-f— 1 




51 




< 
















CJl 


t— 






<_> 






(J 




< 


< 












{— 










5- 




(— 


* 


(- 
<_> 


f— 

v 


CJ 








< 


< 






< 


< 










f- 








< 




w 








0 


0 


< 


< 




<-> 






(_> 




(— 






B- 


< 


< 






f- 


E- 


< 


< 


< 


< 


< 


< 






in 


in 



1 

e 



6 

I 



u 

1/1 



r- 
r— 

o 

E— 

< 

{— 
o 

< 

F- 
< 
f- 

f- 

f- 

t-J 

f- 

0 



< 
< 



(J 
o 
< 
< 

f- 
< 

O 
< 

CJ 
o 
o 



o 
o 



f- 

o 

t- 

<-> 

< 

t— 

-<- 
< 

a 

r- 

s- 
t-J 

t— 
{— 

r- 

< 

< 



o 

< 
< 

< 

< 

o 



O 

in 



en 

b 

<-> 
o 
o 

(- 

< 
o 

H 
< 
M 

-<- 
< 

£1 



WO 92/01787 



PCT/GB91/01253 



o O ^ 



_ 



o O 



© = 8 

in * 5 



o O 



t5 



- ^ 8 



3*g 



o Q 



2 " 



v a 



a 
ss 
> 



v 
u 
e 
u 

3 

V 

03 



V 

u 
a 
v 

« 



0 



o 
c 

-H 
0 



« 

o 
a 

-H 

a 



■8 



<0 -E 



O SM a 

<* H g 



15 = 



S > 



s < 



2 o 



(— f 
fa 



a u 

-u o 



c 

a 4) 

4J O 

X C 



E 

5 



5 



WO 92/01787 



5-/10 



PCT/GB91/01253 



ro 
< 

< 
< 

< 



X 

o 
o 

o 



- < 



r- 

n < 

2 *- 

9 x 

« O 

C ° 

O < 

La ° 



eft 
O 

< 
5 



< 
< 

E— 
x 

o 



< 
< 

< 
< 

(J 
t- 
o 

< 
t- 

X 

(_) 
(J 

o 

.- « o 



m 
u 

9 

'55 

8 



c 

o 

t 
ft 

d 



WO 92/01787 k/lO PCT/GB9 1/0 1253 



id 



m i r 

2r > 

Q ft p 

n= x: x 

> > > 




WO 92/01787 



^flo PCT/GB91/01253 



ELISA of mutant VH domains 



in 
© 



4t 

O 



o 




pUCI 19 
VHD 1 .3 
VHNutTrp 
VHMutLeu 
VHThy-3 



0.0 



WO 92/01787 



PCT/GB91/01253 




WO 92/01787 



PCT/GB91/01253 



&eneHT 

CJ624) T _ (I650 ' 

£±t*.t: TCT CAC TCC G CT . GAA AC i Git GAA AG . 



Q V Q L Q 



V T V 5 5 



1^ P./fl, ™ CAC TCC T GCT cIg GTC CAA CTGCAG AAG CTT AC 6 GTC aCC GTC TCC TCA ACT GTT GAA ACT 



VH 



PsLl 



BstEl 1 



SUBSTITUTE SHEET 



WO 92/01787 



PCT/GB91/01253 



ELISA of fdVHThy-I,Thy-2 and controls 






^fcRNATIONAL SEARCH REPORT^ 

iBtmdocol Apoiicaribo^TO 


PCT/GB 91/01253 


L CLASSIFICATION OF SUBJECT MATTER Of several dissificarioe, symbols apply, indicate ail)* 


According to International Patent deflation (IPC) or to both National OusfianoD ud IPC 

Int. CI. 5 C12N15/12; C12N15/13; C07K13/00; 


A61K37/02 


a. FIELDS SEARCHED 


Minimun Documentation Searched" 1 


Cbusiflcatton System 


Classification Symbols 




Int. CI. 5 


C07K 



Documentation Searched other thin Minimum Documentation 
to the Extent that such Documents are lnduded ia the Fidds Searched 





Cteirioa of Document, " with indication. where appropriate, of the relevant passages * | Wevant t0 CUia ^ 


X 
Y 


EP,A,0 368 684 (MEDICAL RESEARCH COUNCIL) 16 May 
1990 

See page 5, lines 19-32; page 6; page 7, lines 
1-23; page 13, lines 3-30; examples 7-12; 
claims; figs. 10,12 


1-16,18 
17 


X 
Y 


WO, A, 8 809 344 (CREATIVE BIOMOLECULES, INC) 1 
December 1988 

See page 22, lines 5-32; pages 23,24,29; pages 
30-57; claims 


1-6, 
9-16,18 

17 


Y 


SCIENCE. 

vol. 216, 14 May 1982, LANCASTER, PA US 
pages 696 - 703; 

WILLIAMS, A. F. ET AL: "Neuronal Cell Thy-1 
glycoprotein: Homology with Immunoglobulin 1 
cited 1n the application 
see the whole document 

-/— 


17 



" Special categories of dted documents : 10 

*A" document defining the general state of the art which is not 

considered to he of particular relevance 
*E" earlier document but published on or after the international 

filini ■»*• 

*L* document which may throw doubts on priority daim(j) or 
which Is ched to establish the publication date of another 
citation or other special reason (as specified) 

'O* document referring to an oral disclosure, use, exhibition or 
other means 

*P* doemment published prior to the international filing date but 
later than the priority date daiaed 



or priority date tad not la conflict wtth the apeucarioo but 
died to understand the prindple or theory underlying the 
ta veriri on 

~X' document of particular relevance; the daimed invention 
cannot he considered novel or cannot he considered to 
involve an inventive step 

*Y r document of particular relevance; the daiaed invention 

cannot be considered to involve as inventive step when the 
document is combined with one or more other such «ocu- 
ments, such combination being obvious to a person tallied 
ha the art. 

'A' document member of the same patent family 



Dit< of the Actual Completion of the international Search 

14 NOVEMBER 1991 


2 2. 11. 91 


International Searching Authority 

EUROPEAN PATENT OFFICE 


Signature of Authorized Officer 

NAUCHE S - A -_^A^ 


3s*j 





i PCT/ISA/210 | 



International Appli 



sm^^No 



PCT/GB 91/01253 



m. DOCUMENTS CONSIDERED TO BE RELEVANT (CONTINUED FROM THE SECOND SHEET) 



Category 0 



Citation of Document, with indication, where appropriate, of the relevant passages 



Relevant to Claim No. 



NATURE. 

vol. 341, 12 October 1989, LONDON GB 
pages 544 - 546; 

WARD, E.S. ET AL. : 'Binding activities of a 
repertoire of single immunoglobulin variable 
domains secreted from Esherichia Col i . 1 
cited in the application 
see the whole document 

JOURNAL OF MOLECULAR BIOLOGY 

vol 204, no. 1, 5 November 1988, ACADEMIC 

PRESS, H. B. JOVANICH, PUBL 

pages 155 - 164; 

JANIN, J. ETAL.: 'Surface, subunit interfaces 
and interior of oligomeric proteins. 1 
see the whole document 

PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES 
OF USA. 

vol. 82, July 1985, WASHINGTON US 
pages 4592 - 4596; 

N0V0TNY, J. ET AL. : 'Structural invariants of 
antigen binding : comparison of immunoglobulin 
VL-VH AND VL-VL domain dimers. * 



1-18 



1-18 



1-18 



Fan PCT/1SAJ210 (exrr» (J»*u-y l«J| 



ANNEX TO THE INTERNATIONAL SEARCH REPORT 
ON INTERNATIONAL PATENT APPLICATION NO. GB 

SA 



9101253 
49593 



Thim annex fats the patent family umiibufc rcUcinc to me patent documents cited in the above-mesa ooed mternadoaai search report. 
The BMdKn arc as contained in me European Patent Office EDP file on .. /1WQ1 
The European Patent Office is in bo way liable for these particular* which are merely given for the purpose of mrormatjoc. IS-/ 11/21 



Patent document 
cited in search report 



Publication 
date 



Patent faoaly 
member's) 



Pubticnooa 
date 



EP-A-0368684 



16-05-90 



AU-A- 
WO-A- 



4520189 
9005144 



28-05-90 
17-05-90 







JP-T- 


3502801 


27-06-91 


WO-A-8809344 


01-12-88 


AU-B- 


612370 


11-07-91 




AU-A- 


1804988 


21-12-88 






EP-A- 


0318554 


07-06-89 






JP-T-" 


2500329 


-" 08-02-90 - 



For more details about this annex : see Official Journal of the European Patent Office, No. 12/82 



XP 002046115 cm**/,* cX%TJ/?6 

A SHORT POLYPEPTIDE MARKER SEQUENCE 
USEFUL FOR RECOMBINANT PROTEIN <M * ^ 
IDENTIFICATION AND PURIFICATION p -*~ — 



Thomas P. Hopp, Kathryn S. Prickett, Virginia L. Price, Randell T. Libby 1 , Carl J. 
March, Douglas Pat Cerxetti, David L. Urdal and Paul J. Conlon 

Immune* Corporation. 51 University Street. Seattle. Washington 98 101. 'Present address: Departments of Genetics. SK-50. and 
Pathulogy. SM-JU. University of Washington. Seattle. WA 98195. 



A small hydrophilic peptide of eight ami- 
no acids (AspTyrLvsAspAspAspAspLys) 
was engineered onto the N-terminus of a 
variety of recombinant lymphokines for 
the purpose of aiding in their detection 
and purification from yeast supernatants 
or E. colt extracts. An antibody specific for 
the first four amino acids of this sequence 
was used as a detection reagent and for 
immunoaffinity purification of products 
under mild conditions. Because of the 
small size of the peptide moiety and its 
hydrophilic nature, the proteins were un- 
affected by its presence and retained a 
high level of biological activity. In addi- 
tion, it was possible to remove the peptide 
via an enzymatic cleavage procedure us- 
ing enterokinase. 

Among the many approaches taken to improve 
the yield and purity of recombinant proteins, 
one particularlv useful procedure is to express 
the desired polypeptide as part of a larger 
fusion protein 1 •'-*. Fusion to a protein-export signal se- 
quence has been used to cause secretion of products from 
yeast J and £. coti* cells. Furthermore, it has recently been 
recognized that an attached fusion polypeptide sequence 
might serve as an aid to identifying or purifying the 
product. For example, in several cases the added polvpep- 
tide segment contains a complete second protein thai 
binds to affinity columns via its specific substrate or 
ligand. These include (3-galactosidase fusion proteins that 
bind to aminophenvlihiogalactosidyl Sepharose columns 3 
and protein A fusion proteins that bind to immunoglob- 
ulin columns". Such fusion proteins can be highly purified 
in good vields in a single step bv passing cell extracts or 
supernatants over columns of an appropriate affinitv 
matrix, then eluting the purified fusion protein by chang- 
ing conditions so that binding is no longer possible. A 
related approach is to use an antibodv directed against the 
added sequence as a detection or affinity purification 
reagent 7 -* although the high binding , affinity of 
most antiscra and monoclonal antibodies often requires 
the use of denaturing conditions for elution of the prod- 
uct. 

The fusion protein approach as several drawbacks that 
have not been adequately addressed in the past. First, 
most fusion protein products fail to fold properlv into a 
native, active state"'. It is possible that the added polypep- 
tide segment is responsible for this misfolding due to 



unfavorable interactions during folding of the fusion 
protein. This often necessitates treatment with strong 
denaturants such as 8M urea and 7M guanidine HCl 
followed by refolding procedures 1 l 

A further problem with fusion proteins is that it is often 
difficult or impossible to remove the additional amino- or 
carboxyl-terminal sequence from the desired protein 
product. One solution has been to use relatively drastic 
conditions and chemical cleavage agents such as CNBr in 
70^ formic acid*-' 3 or low pH incubations 13 to effect 
cleavage. However, recent studies have attempted to use 
somewhat milder chemical cleavages such as hvdroxvl- 
amine treatment at pH 9.0 14 or enzymatic cleavage proce- 
dures under physiological conditions. The blood clotting 
factor X, has a proteolytic specificity for the tetrapeptide 
sequence IleCluGlyArg, and has been used to liberate (3- 
globin from a kcll protein fusion sequence 10 . A sequence 
allowing specific cleavage by collagenase has also been 
proposed 5 . Sassenfeld and Brewer 11 developed a facilitat- 
ed ion-exchange purification technique by fusing their 
proteins to a C-terminal series of arginine residues that 
are subsequently removed by carboxypeptidase B treat- 
ment. These enzymatic processes have been successful in 
several instances, but often have been limited bv poor 
cleavage yields or by unwanted cleavages that occur within 
the desired protein sequence 10 . 

We decided to create a recombinant protein detection 
and purification system that incorporated several aspects 
of the above mentioned procedures in order to produce a 
fusion sequence with a combination of the most desirable 
properties. Here we report the development of a short. N- 
terminal fusion sequence AspTyrLysAsp AspAspAspLvs. 
that we refer to as a marker sequence or "Flag'*" segment 
for antibody mediated identification and purification of 
recombinant proteins. We also describe a monoclonal 
antibody that reacts with this sequence and can be used as 
an immuno-affintty purification reagent that purifies 
marker fusion proteins under very mild conditions. Final- 
ly, the marker sequence can be removed by treatment with 
the protease, enterokinase. which is specific for the five C 
terminal amino acids of the marker sequence' 6 . No harsh 
treatments are required at any step in this process and 
proteins purified by this approach retained their biologi- 
cal activity throughout the purification, even while the 
marker sequence was attached. This paper describes the 
expression of several such fusion proteins in Saccharomxcei 
crrevisiae and Escherichia coli. 

RESULTS 

In order to develop this system, we performed a series 
of interrelated steps. The eight amino acid marker pep- 
tide was engineered onto the N-terminus of the tvmpho- 
kine interleukin 2 (IL-'J)^ by means of synthetic oligonu- 
cleotides. The fusion protein was expressed in veast and 
the product purified by conventional means, then used as 
an immunogen to produce a monoclonal antibody HE I I | 



specific for the marker sequence. The antibodv func- 
tioned as a reageni for a number of diflerent immuno- 
chemical procedures including "Westerns." "dot blots, 
immunoprecipitaiions, and affinity purification when 
coupled to a solid support. Furthermore, the discovery 
thai the 4E11 antibody would release its antigen when 
calcium was removed from the medium led to the devel- 
opment of a mild purification procedure for fusion pro- 



tein elution from the affinity columns. Next, treatment of 
the fusion proteins with enterokinase demonstrated that 
the enzvme was capable of removing the marker segment 
efficiently, with little or no observable degradation of the 
desired protein product. Finally, measurements of specif- 
ic activity demonstrated that, for all proteins tested, no 
appreciable loss of activity was caused by the presence of 
the marker segment on the N-termini of the recombinant 
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nwH 1 PUsmid pIXY8. for expression of the IL-2 fusion 
protein in veast. The synthetic oligonucleotides used in con- 
structing this plasmid extended from the Kpnl site at i he left 
of the sequence shown below the plasmid diagram, to a blunt 
end at the HgiAI site near the right side, ending with the 
codon for the C-terminal Ivs residue of the marker peptide 
segment. The (L-2 coding sequence extended from a blunt 

Hind III 



end before the first codon of IL-2 (Ala) to a StuI site beyond 
the termination codon. The arrows below the amino acid 
sequence indicate the sites of cleavage by the KEX2 protease 
to remove the a-factor precursor sequence from the primary 
translation product, and by enterokinase. to remove the 
marker peptide from the product protein. 
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3 Plasmid pRL6-6-37. for expression of the GM-CSF 
fusion protein in £. colt. Abbreviations: lpp p . lipoprotein 
promoter; Uuf°. lactose promoter-operator region; amp. A sig- 
nal, outer membrane protein A signal peptide sequence: GM- 



CSF. human granulocyte-macrophage colony stimulating fac- 
tor; Ipp'". lipoprotein terminator; tatf*. lactose repressor; on. 
origin of replication. 
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products. 

Secretion of fusion proceins from yeast. Figure I shows 
the plasmid pl\Y3, used for production of the IL-'J 
fusion protein in yeasi. Similar constructs with the ADH2 
promoter replacing the a-factor promoter were used to 
express granulocyte colony stimulating faccor (G-CSF)"*. 
interteukin 3 (IL-3)'". interleukin 4 (IL-4)-" and granulo- 
cyte-macrophage colonv stimulating factor {GM-CSF)- 1 
fusion proteins. All of these proteins were secreted into 
culture media bv yeast. Each had the expected molecular 
weight for the correctly processed form (i.e. with leader 
peptide removed) and yielded the expected sequence of 
the marker peptide on N-terminal amino acid sequence 
analysis. The IL-2 fusion protein was purified by HPLC 
! for use as an immunogen. while the other fusion proteins 
were purified bv the 4EI I antibody immunoaffinity chro- 
matography procedure (below). For comparison, essen- 
tially identical vectors were prepared that expressed each 
protein without the marker peptide. These products were 
recovered from the yeast culture media and purified to 
homogeneity by conventional techniques including HPLC 
and ion exchange chromatography-"-'. 

Expression of CM-CSF fusion protein in E. coli. The 
construction of the plasmid pRL6-6-87 for expression and 
secretion of the CM-CSF fusion protein in E. coli is 
outlined in Figure 2. This plasmid allows the secretion of 
the marker peptide CM-CSF fusion protein by means of 
the signal peptide from the outer membrane protein 
OmpA. The product obtained from E. coli cultures had 
the expected molecular weight for the marker peptide 
GM-CSF fusion and yielded an N-terminal amino acid 
sequence corresponding to the marker peptide sequence. 

The marker-specific antibody. The isotype of the 4EI I 
antibody is IgC 2B. It was found to be reactive with 
proteins bearing the marker peptide sequence in a variety 
of procedures, including ELISAs. dot blots. Western blots, 
immunoprecipitation and affinity chromatography, as de- 
scribed below. The antibody was found to react with all of 
the marker peptide fusion proteins that we have pro- 



duced. The antibody exhibits no reactivity with the non- 
marker products, or any component present in E. coli 
extracts, or in yeast culture medium. 

Purification. Figure 3 shows the results of typical 
affinity purification chromatograms on affinity columns 
made with the 4E11 antibody. In Figure 3A. a culture 
supernatant obtained by fermenting yeast bearing the 
CM-CSF expression vector was passed over the column to 
purify the fusion protein that had been secreted into the 
medium. Medium components were removed by washing 
with PBS containing 0.5 mM CaCh. while the CM-CSF 
remained bound to the antibody. Subsequent elution with 
PBS containing EDTA dissociated the marker p^ptide- 
antibody complex and released the CM-CSF fusion pro- 
tein as a purified product. The multiple molecular weight 
species eluting from the column are typical of glycopro- 
teins secreted from yeast and result from heterogeneous 
glycosylation by yeast cells. All bands were identified as 
GM-CSF. based on Western analyses using 4E1 1 as well as 
ami GM-CSF monoclonal antibody as developing re- 
agents. 

Figure 3B shows the results of affinity chromatography 
of an extract of £. colt cells that had been transformed 
with pRL6-6-87 in order to produce the GM-CSF fusion 
protein. Chromatography was carried out as with the 
yeast GM-CSF fusion, except that 1 mM Ca~ 2 was used 
during washing and 0.1 M glycine HCI pH 3.0 buffer was 
used to elute the product. The GM-CSF fusion protein 
eluted as a single molecular weight species because £. coli 
does not glycosylate proteins. The product was essentially 
pure after this single chromatographic step. 

The binding of the 4E1I antibody to the marker 
peptide is dependent on the presence of calcium. This 
property has been reported for a few other antibodies in 
the past'*' 3 . We observed that if insufficient levels of 
calcium were present in washing buffer, then the fusion 
proteins would leak from the affinity column, even 
though they had bound quantitatively when yeast super- 
natant or E. coli extract was passing over the column. Bv 




nCVIf 3 Affinity purification of recombinant fusion proteins. 
Panel A. Sikrr-suincd polvacrvlamide gel of veasi CM-CSF 
fusion protein purification. Lanes are: !: molecular weight 
standards: *J: veast supernatant: How through material: 4- 
6: sequential PB5'0.5 rnM CaCI : washes; 7-|">: sequential 
PBS'- U mM ED TA eluttons: lti: molecular weight standards 
Panel B: SiUer-Mamed ge! of E. cult CM-CSF fusion prmeu; 



purification. Lanes arc: 1: molecular weight standards: "1: E. 
colt supernatant; 3: How through material: -i-6: sequential 
PBS'0.5 mM CaCI; washes: 7-11: sequential 0.1 M Cly-HCl 
pH 'S elutions: 1'J: molecular weight standards. Numbers at 
right indicate M f values for the standard proteins (in kiloDai- 
tons). The l JA K standard w;is omitted from the nel shown in 
panel A. 



testing the effect of various concentrations of CaCl; in 
washing buffer, we determined that concentrations of 
calcium above 0.3 mM were necessary to retain the fusion 
proteins on the column. Given this calcium dependence, it 
was found that rapid elution of fusion proteins could be 
achieved by using EDTA in the elution buffer. It was also 
possible to eluie proteins simply by using a calcium free 
elution buffer after the columns had been washed with 
PBS containing calcium. However, under these conditions 
the eluted fusion protein tended to spread through more 
fractions than when the elution buffer contained EDTA. 

Enterokinase treatment. Figure 4 shows the results of 
enterokinase treatment of the IL-2 fusion protein. In- 
creasing amounts of enterokinase were added to identical 
aliquots of the IL-2 fusion protein, then the samples were 
incubated for 16 hr at 37°. As the concentration of enzyme 
was increased, a component appeared at the molecular 
weight of the authentic IL-2. At the highest concentra- 
tions of enzyme, the conversion of fusion protein to 
authentic protein was complete, and an approximately 
identical amount of authentic product had been formed. 
Western blot analyses using 4E1 1 and anti-IL-2 monoclo- 
nal antibodies confirmed the identities of the products 
seen on the silver gel. Only the higher molecular weight 
-species-was. reactive_with„4El.l ._but.as_expected_boih_the_ 
higher and lower molecular weight species were reactive 
with the anti-IL-2 antibody. Amino terminal sequence 
analysis on the resulting cleavage products indicated that 
the IL-2 fusion protein was cleaved after the second lysine 
of the marker peptide yielding the expected sequence for 
the authentic IL-2 N-ierminus. As can be seen in Figure 4. 
the cleavage is complete, and in this case results in only 
small amounts of detectable lower molecular weight by- 
products formed by enterokinase cleavage within the IL-2 
sequence. Essentiallv the same cleavage pattern was ob- 
tained with the other fusions as well. In no case were 
degradation products present in quantities greater than 
10% of total protein (determined by sequence analysis). 

Retention of biological activity. The marker peptide 
fusion proteins expressed in yeast and E. coii were all 
biologically active despite the presence of the marker 
segment, as is seen in Table 1. In all cases the levels of 
biological activity obtained with the fusion proteins were 
comparable to wild-type recombinant proteins expressed 
without the marker sequence. In the case of GM-CSF the 
specific activity values in Table 1 were obtained before 
and after removal of the marker sequence by enteroki- 
nase. The nearly identical specific activities obtained indi- 
cate that the vield of cleaved product is probablv near 
100%. 

DISCUSSION 

The marker peptide fusion system described in this 
report comprises a unique and widely useful technique 
for protein identification and purification. In addition, 
two observations have shown that the marker peptide is 
compatible with heterologous expression systems: one. 
both the £. coii OmpA signal and the yeast pre-pro alpha 
factor leader sequences, when fused to the marker pep- 
tide, are correctly processed by their respective proteases 
allowing secretion of marker peptide-protein fusions with 
the correct N-ierrninus. And second, because many inves- 
tigators have reported problems in the N-terminal proc- 
essing of microbially expressed mammalian proteins'* 4- "*' 0 , 
the ability of the marker peptide to protect the N- 
terminus of the desired product may be another impor- 
tant feature of this system. 

The capability to produce authentic N-termini upon 
enterokinase treatment is an improvement over a number 
of other fusion protein approaches. Those that require 



chemical cleavages using Asp- Pro 1 - or Asn-Gly' 5 directed 
reagents, for example, must necessarily leave a proline or 
glycine at the N-terminus of the product. Although we 
have not yet tested the ability of enterokinase to cleave the 
marker sequence from N-termini containing all of the 20 
possible amino acids, we have found that it is capable of 
cleaving products with N-terminal Glu, Ala.Thr. Leu and 
lie residues. This suggests that this procedure will be 
useful for a wide range of N-termini including charged 
and uncharged, hydrophobic and hydrophilic residues. 

There are several requirements that should be met bv 
an efficient detection and purification system based upon 
fusion polypeptide expression: First, the added marker 
segment should not interfere with the native folding of 
proteins to which it is attached. Second, the marker 
peptide sequence should be intrinsically water soluble and 
should retain a high degree of exposure in the aqueous 
environment of the protein, so that it can readily interact 
with the affinity purification substrate. Third, it should be 
useful in an affinity purification step that requires only 
very mild media, and be elutable with a non-denaturing 
and inexpensive eluant. Finally, the marker, peptide 
should be easy to remove and the product protein should 
not have any amino acids added or deleted once the 
..mar ke r peptide has been removed. The Flag"* pept ide 
fusion system was designed to possess all of these proper- 
ties, and our data with several recombinant proteins 
suggest that it may prove to be a universal purification 
system for proteins expressed in heterologous organisms. 

Several factors were considered in choosing the specific 
sequence of the marker peptide moiety. We chose to limit 
the marker peptide sequence to only eight amino acids 
because it can easily be encoded in a single synthetic 
oligonucleotide, and because the longest trypsinogen pro- 
sequences are of this length. We therefore could be 
reasonably sure that the trypsin-activating enzyme, en- 
terokinase. would work efficiently to release the peptide. 
Additionally, because antibodies require up to six or seven 
amino acids for avid binding interactions, we reasoned 
that eight amino acids should be the minimum sequence 
capable of strong binding to an antibody while allowing 
one or more of the last amino acids on the C-terminal end 
to act as a spacer to separate the antibody binding portion 
from interference with the bulk of the protein. Finally, the 
five C-terminal amino acids of the marker sequence 
represent the minimal enterokinase specificity site. 
AspAspAspAspLvs. 

The choice of Lys at position three of the. marker 



TilU 1 Expression level and specific activity of marker fusion 
proteins. 



Protein 



Specific Activity (U/mg x 10"*) 
With Marker Without Marker 5 



IL-2 
G-CSF 
IL-3 
1L-4 



1-3 
10-20 
15-20 



CM-CSP 1 15-20 



Expressed in Yeast 
240 - 20 
63 - 10 
49 - 19 
5.4 r 1.6 
Expressed in £. coii 
150 r 60 



280 _ 40 
12 r 1.0 
42 - 12 

N.D. C 

140 - 60 



'Determined by dot blot assay usint; 4E1 I antibody to detect 
marker peptide containing material. 

8 Proteins in this column (except £. coii GM-CSF) were pro- 
duced without the marker segment and purified by conven- 
tional means. 
"Not determined. 

d ln this case, specific activities were determined on the same 
sample inrfore and after enterokinase treatment to remove the 
marker segment. 
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fWOtf 4 Enterokinase digestions. The IL-2 fusion protein was 
incubated with increasing amounts of bovine enterokinase 
and the digestion was analyzed by silver-staining PAGE. Lanes 
are: 1: purified IL-2 fusion protein (200 ng) no enzvme: 2: 
fusion protein i- 2 ng enterokinase: 3: fusion protein - 10 nn 
enterokinase; 4: fusion protein - 20 ng enterokinase. 
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SEQUENCE POSITION 
I 5 Hydrophilicity plot of the IL-2 fusion protein. The 
profile was generated using the updated HYDR03 program 
of Hopp 19 . The scale is oriented so that hydrophilic is at top: 
hydrophobic is at bottom. Valley regions are expected to be 
buried portions of the polypeptide, whereas peaks are expect- 
ed to be exposed at the surface of the protein. The prominent 
peak near the N-terminus results from the extremelv hvdru- 
philic hexapepiide, LysAspAspAspAspLys, contained within 
the marker peptide sequence. 



sequence causes the marker peptide to contain the hexa- 
pepiide sequence. LysAspAspAspAspLys. that has a maxi- 
mum value on the hydrophilicity scale of Hopp and 
Woods 27 . Such maximally hydrophilic sequences have 
been proven to express strong antigenicity and are corre- 
spondingly likeh to adopt a highly exposed conformation 
in the three dimensional folding of a protein'-*. As can be 
seen in Figure 5. it is impossible for any other region of a 
protein co have a higher hydrophilicity value than this 
maximally hydrophilic sequence, so the marker segment is 
virtually guaranteed to be exposed at the surface of any 
fusion protein. Therefore it can always be expected to be 
available for binding to antibody. Perhaps most impor- 
tantly, the strong predilection for externalizaiion should 
guarantee that the marker segment will not interfere in 
the adoption of a native conformation by the remainder 
of the protein. 

In addition to the hydrophilic effects of Lys at position 
three, several other considerations influenced the choice 
of amino acids at the N-terminus of the marker peptide. 
Aromatic amino acids have been recognized as major 
factors in antigen-antibody interactions 29 so a tyrosine was 
placed at position 2, flanked by charged amino acids. 
Recent evidence suggests that aromatic residues that are 
flanked by charged sequences are more likely to be 
involved in antigenic sites than are other aromatic resi- 
dues in less polar environments'-' 8 . The decision to place 
an Asp residue at the N-terminus was made in pan 
because the negative charge on the Asp should aid in 
exposing the Tyr to antibody, as mentioned above, and in 
part because, with the inclusion of the Asp at position 1. a 
total of eight charges are to be found on the marker 
peptide moiety, including the N-terminal amino group. 
This preponderance of charged residues was expected to 
make it likely that antibody binding would be heavily 
dependent on charge-charge interactions, and therefore 
might be highly susceptible to elution with commonlv 
used salt solutions such as 2 M MgCl- or 1 M NaCI 9 . In the 
end, the serendipitous discovery that Ca"" was involved in 
the charge-dependent binding of the marker sequence to 
the 4E1I antibody made even these mild salt treatments 
unnecessary. 

We have seen that fusion proteins retain the appropri- 
ate specific activity even with the marker segment still 
attached, and that this activity can be maintained after 
enterokinase treatment to remove the marker sequence. 
Comparisons of several of these fusion protein products 
with their natural counterparts (Table 1) demonstrated 
that the presence of the marker did not decrease the 
specific activity of the fusion proteins relative to the same 
proteins with no extraneous amino acids added. We have 
recently begun using a larger version of the 4E 1 I column 
to prepare proteins in milligram quantities. This level of 
scale-up required no special procedures or equipment, 
and can still be done as a bench top experiment. Further 
scale up for production of gram or kilogram quantities is 
contemplated, and will be limited only by availability of 
antibodv and enterokinase. One advantage of the 4E1 I 
antibody is that it can be purified on an affinity column 
comprised of chemically synthesized marker peptide at- 
tached to a solid support and eluted with EDTA. 

One area that remains potentially problematic is the. 
provision of adequate enterokinase for this process, both 
in terms of quantitv and quality. We sampled commercial 
sources nf enterokinase. but found that our fusion pro- 
teins were digested into small fragments, presumably by 
contaminating chymotrypsin. trypsin and elastase that arc 
likely to be present in these partially purified prepara- 
tions. Our own crude bovine intestinal preparations also 
caused substantial unwanted hvdrolysis. until we used the 



protocol of Liepnieks and Light" 1 for removing these 
other proteases. The fact that a minor amount of unwant- 
ed cleavage is seen in some cases when the marker peptide 
is removed (Fig. 4) suggests that traces of contaminating 
proteolytic activity may still be present in our purified 
preparations of enterokinase. An ideal solution would be 
to clone enterokinase and express it in a recombinant 
organism. This would provide starting material with 
much lower levels of contaminating proteases, and much 
higher levels of enterokinase. Furthermore, with the 
enterokinase gene in hand, it might be possible to engi- 
neer a smaller form of the molecule, lacking the hvdro- 
phobic portion that binds it to the membranes of the 
intestinal villi. This would simplify purification and de- 
crease the mass of enterokinase needed for cleavage of the ! 
marker segment. 

Despite the need for further development of enteroki- < 
nase. this system for fusion protein detection and puriticu- < 
lion already represents a useful technique, h offers the j 
possibility of using a single procedure for the purification ; 



of multiple fusion proteins. Although it is also possible to 
purtfv fusion proteins from veast medium or E. cod cells 
bv conventional means such an ion exchange or reverse 
phase chromatography, these procedures require new 
method development for each new protein, whereas with 
marker fusions, the same process is applicable to all 
proteins. Finally, because the marker segment does not 
appear to have decreased the biological activity of any of 
the proteins that it has been placed on. it may not always 
be necessarv to remove the marker segment in order to 
obtain an active product. In such cases, this useful "han- 
dle" can be retained on the molecule, enabling investiga- 
tors to readily detect and manipulate their recombinant 
protein products. 

EXPERIMENTAL PROTOCOL 

Yeast plasmid construction. The yeast vector used tor protein 
expression has been described previously " . This vector contains 
sequences from pBR322 that allow selection iAmp r > and replica- 
tion in £. colt, as well as the yeast Trpl gene and 2u, origin of 
replication for selection and autonomous replication in a irpl 
yeast strain. Expression of foreign genes is under control of the 
a- factor promoter 1 or the ADH2 promoter" and secretion is 
directed ov the a-facior leader peptide. To generate the IL-2 
expression vector pIXYS. the mature coding region of lL-2'~ was 
fused in-frame to the marker peptide and the a- factor leader by 
. means of a synthetic . oligonucleotide, linker encoding the five C- 
icrminal amino acids of the a-factor leader and the eight amino 
acids of the marker peptide (Fig. I). The vectors that directed the 
expression of the other products were generated by two modifi- 
cations of pIXYS. First, the a- factor promoter was removed by 
restricting the plasmid with EcoRI and Pstl. then inserting the 
ADH2 promoter using a synthetic oligonucleotide linker. Sec- 
ond, the marker and appropriate proiein coding sequences were 
added in place of the IL-2 sequence (Fig. I) and linked with a 
synthetic oligonucleotide that extended to (he Hpal site. 

Growth of yeast strains. S. cerrvuiae sirain XV2 1 8/fa/a-trp- 1) 
was grown in either selective medium (YNB ~trp. consisting of 
0.67% Yeast Nitrogen Base (Difco). 0.59c Casamino acids. 2*3 
glucose, 10 u.g/ml adenine and 20 u-g'ml uracil] or rich medium 
(YPD. consisting of \9c Yeast Extract. 2^t peptone and 19£ 
glucose supplemented with 80 M-g-'ml adenine and 80 ixg'ml 
uracil). Yeast transformations were done by selecting for Trp" 
transformants". Cultures were grown for biological assay by 
inoculating 20-50 ml of rich medium with the appropriate strain 
and growing the cultures at 30°C to stationary phase. Cells were 
then removed by centrifugation and the medium was filtered 
through a 0.45u. cellulose acetate filter. Sterile supernatcs were 
stored at 4°C. Larger scale fermentations were done in a 10 liter 
New Brunswick Microferm fermentor. Cells were removed from 
the medium using a Millipore Petlicon filtration system. 

Construction of the £. coli vector. Plasmid pIN-III-OmpA) is 
an £. colt expression vector regulated by the tandem Ipp" (lipopro- 
tein promoterj/Zac 1 * 0 (lactose-promoter-operator) that contains a 
synthetic ompA leader sequence for protein secretion 14 . Construc- 
tion of the GM-CSF expression vector was accomplished by 
restriction digestion at the unique Bam HI site of plN-UI- 
OmpA), foilow-ed by its conversion to blunt ends by treatment 
with reverse transcriptase (Boehringer-Mannheim). The vector 
was subsequently restricted with Hind I II, and used in a three- 
way ligation with a synthetic oligonucleotide encoding the marker 
peptide sequence and cDN'A encoding GM-CSF to produce 
pRL6-6-87 as outlined in Figure 2. 

Growth of £. coti. Plasmid pRLrt-6-S7 was introduced bv 
transformation into £. eoti strain JM 107. (A/nr. pro. tht. urA. tnd.\. 
ibcBl5. hspR-i/F', traD36. proAB ' . iac/ q -ZAM 15) which was grown 
at 37°C in M9 minimal medium containing IVc (w/v) methionine 
assay medium (DIFCO) and ampicillin (50 p.g/ml) to an OD NW of 
0.25. Cultures were induced following addition of isopropvl-Q-u- 
thiogalactopvranoside (IPTG) and evelic 3'-5* adenosine mono- 
phosphate (cA.MP) to 2mM and 4m.M, respectively, and allowed 
to accumulate the GM-CSF fusion protein for 2-4 hr. Cells were 
harvested by centrifugation, and pellets either stored at - 70°C or 
I directlv processed for extraction and purification of marker 
I fusion proteins. £. coii pellets were extracted hy the following 
j procedure: the pellet from 500 ml of culture was suspended in 5u 
j ml of 150 m.M NaCI. 50 m.M NaH : PO a . pK3.4. to which I mM 
phenvlmethvlsulfonylfluoride had been added immediatelv prior 
! to mixing. After freezing I -70 J C» and thawing three times to Use 
> the £. rati cells the sample w-js incubated at 37 a C for 30 niin to 



complete lysis and to extract the protein product. The viscous 
extract was treated bv dounce homogenization to achieve a 
uniform solution, then centrifuged at 25.000 g for 45 min at 4 3 C. 
The supernatant was adjusted to 0.5 mM CaCb. recemrifuged if 
necessary to remove any resulting precipitate, then applied to the 
affinity coiumn. 

Preparation of immunogens. Palmitic acid conjugated pep- 
tides were produced by solid phase chemical synthesis as de- 
scribed previously 1 *. The antigenic marker peptide had the 
sequence AspTyrLysAspAspAspAspLysGly ProLysLysGly to 
which palmitovl moieties had been attached on the epsiion amino 
groups of the two C-ierminal lysines. It is referred to as CDP- 
marker (C-ierminal dipalmitoyl marker peptide). A second pal- 
mitovl peptide. NDPCM I (N-terminal dipalmitoyl GM-CSF pep- 
tide 1)" was used as- a non-specific binding control. It had the 
structure LvsClyGlyGluSerPheLysGlu AsnLeuLys AspPhcLeu- 
ValGly. and also possessed two palmitoyl moieties, in this case 
attached to the two amino groups of the N-ierminat lysine 
residue. For purification of the IL-2 marker peptide fusion 
proiein. supernatants of yeast expressing the EL-2 fusion protein 
were applied to a reverse phase HPLC column. The IL-2 fusion 
product was eluted from the column using a gradient of acetoni- 
trile. as described previously 2 *. 

Immunization. B.ALB/c female mice were purchased from the 
Jackson Laboratories (Bar Harbor. ME) ana maintained in our 
animal facility. Mice were immunized subcutaneously with 250 
ixe of tL-2 fusion proiein emulsified in Freund's complete 
adjuvant, followed with 125 u-g of the same protein emulsified in 
Freund's. incomplete, adjuvant four weeks later. Two weeks after 
the second inoculation, a serum antibody titer to lL-2 and the 
marker peptide was measured by "dot-blot". The animal was then 
challenged with 10 u,g of protein intravenously four days prior to 
fusion. 

Hybridoma derivation. Four days after the intravenous boost, 
the animals were sacrificed, their spleens removed, and a single 
cell suspension prepared. The splenocytcs were fused to the 
HAT sensitive myeloma cell. NS-1. The resulting hybridomas 
were then assayed for the production of antibodies to the marker 
peptide seven to ten days later by ELISA (see below). One 
hybridoma antibody consistently produced a positive reaction 
specifically with the marker peptide moiety. This cell line, desig- 
nated 4E11. was then cloned by limiting dilution, isotyped. and 
further characterized. 

EUSA. Various peptide solutions (CDP-marker. or NDP 
CM I) were applied to HA plates (Millipore. Bedford. MA) at a 
concentration of 40 ng per well and allowed to incubate for 30 
min at room temperature. Nonspecific protein binding sites were 
blocked by an incubation with 37c bovine serum albumin in Tris 
buffered saline. pH 7.0 (TBSA) for I hr at room temperature. 
Hybridoma supernatants were added and the plates incubated 
for I hr. Following this incubation, the plates were washed with 
PBS and an alkaline phosphatase labeled goat anti-mouse anti- 
body (Sigma Chemical, St. Louis. MO) was added. Following a I 
hr incubation, the plates were washed several times with PBS and 
a coiorimetric indicating reagent was added i substrate tablets. 
Sigma Chemical). Contents of each HA plate were then trans- 
ferred to a polystyrene 96 well plate (Linbro/Titertek. Flow 
Laboratories. McLean. VA) and the absorbance at 405 nm 
determined on a Titcrscan (Flow Laboratories). 

Production and purification of 4EI 1 antibody. Prisiane- 
primed BALB'c mice were injected IP with 1*10 hybridoma 
cells. Ten to twenty days later, the ascitic fluid was recovered, 
centrifuged at 1000 * g for 30 minutes at 4*C. passed through 
cotton gauze, and the supernatant stored at -20 a C until needed. 
The monoclonal antibody 4E11 was purified from ascites fluid 
using MAPS II Protein A Affigcl (Bio-Rad. Richmond. CA) 
affinity chromatography. The purified antibodv was found to be 
homogeneous bv SDS-PAGE analysis. 

4E1 1 column pre para uon. Purified 4E1 1 immunoglobulin was 
concentrated bv ultrafiltration. After dialysis against 0.1 M Hepes 
butfer. pH 7.5 at 4°C the antibody was coupled to AfFigel- 10 < Bio- 
Rad) in accordance with the manufacturer s instructions. A 
typical antibody-coupled gel contained from 1.5 to 4 5 me 
amibodv/ml of gel. Columns of 4E1 1 coupled gel of 1.5 ml bed 
volume were prepared in polvpropvlene columns (Bio-Rad) and 
washed with 15 ml PBS. 15 ml 0. 1M glycine HCI. pH 3.0. and 
stored at 4 J C in PBS ; 0.02^ sodium azide. 

4EII column chromatography. Yeast culture filtrates were 
brought to physiological levels of salt and pH bv adding I OX PBS. 
and marie 0.5 mM in CaCU by adding I M CaCl : . and then loaded 
onto the 1.5 ml column of 4E t 1 coupled Affigel 10 under gravitv 
How. £. cut extracts did not require anv further additions because 1 



the extraction medium contained ph vsioioijical levels of salt and 
pH as well as 0.5'mM Ca(JI : . Up to I UO mi of filtrate were passed 
over the column, depending on the level ot expression of the 
recombinant protein. After loading the column was washed with 
three to five aliquots of 3 ml of PBS containing 0.3 m.M CaCU. 
Eluiion was carried out with PBS lacking ('aCl ; and containing 
2.U m.M Na : EDTA or with 0. 1 M glycine HCI pH 3.0. Each 
elution fraction was I ml. Yields of purified proteins were 
determined hv amino acid analysis, and were tvpicalh 15 — 109£ ot 
the theoretical maximum assuming a 2:1 antigen to antibod\ 
binding ratio. 

Enterokinase treatment. Emeroktnase was purified from bo- 
vine intestine by the procedure of Liepnieks and Light m . Samples 
were also provided by A. Light of Purdue University. For 
enterokinase treatment, fusion proteins eluted from the antibody 
column were made IU mM in Tris-HCI (pH 8) and adjusted to 
pH o\0 by addition of IN NaOH. For certain samples, the 
reaction mixture was made 40 m.M in octyl-(3-o-glucoside. Follow- 
ing the addition of an appropriate amount of bovine enteroki- 
nase (1-lO^c by weight", typtcallv 0.2-2% by molarity), the reac- 
tion mixture was incubated for 16 hours at 37T.. Enterokinase 
dilutions were made from a 1 mg/ml stock solution of enzvme in 
10 m.M Tris-HCI. pH S kept at -70°C. 

Bioassavs. The activity of IL-2 was measured using the murine 
IL-2 dependent T-cel! line CTLL-2^. The activity of GM-C5F 
was measured in a human bone marrow proliferation assav 37 and 
the activity of [L-3 was measured by FDC-P2 cell proliferation 1 *. 
IL-4. and G-CSF were assayed as described ,l * "'. Specific activities 
were derived by measuring the biological activities of purified 
samples of each protein, a fter quantifying bv amino jcid analysis. 
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